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@ Diagnostic reagent for hepatitis C. 

@ A diagnostic reagent for hepatitis C, which detects an antibody induced by Infection of hepatitis C virus, 
characterized In that said diagnostic reagent comprises the second envelope protein or the first non-structural 
protein which is encoded by the gene of hepatitis C virus and has a suger chain. This invention also provide a 
method for detecting an anti-hepatitis C virus antibody. The use of the diagnostic reagent for hepatitis C 
according to the present invention makes highly sensitive diagnosis of hepatitis C possible. 
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BACKGROUND OF THE INVENTION 

This invention relates to a diagnostic reagent for hepatitis C comprising an antigen protein translated 
from a genome of hepatitis C virus. More specifically, this invention relates to a diagnostic reagent for 
■"^ 5 detecting an antibody against hepatitis C virus (hereinafter referred to as "HCV"). which comprises a protein 
encoded by a gene of HCV, wherein said protein is Identified as a glycoprotein called the second envelope 
protein or the first non-structural protein (hereinafter referred to as "E2/NS1 

The first successful cloning of human hepatitis virus which had been called non-A. non-B hepatits virus 
was accomplished in 1988 by Chiron Co.. Ltd. U.S.A and the hepatitis virus was designated HCV. Further. 
70 Chiron Co., Ltd. succeeded in expressing in a yeast a fused protein which comprises at the C-terminal the 
polypeptide corresponding to the region having 363 amino acid residues from the third nonstructural protein 
(NS3) to the forth non-structural protein (NS4) both of which are portions of nonstructural proteins of HCV 
and at the N-terminal human superoxide dismutase(European unexamined patent publication No. 318216) 
and, using this recombinant antigen, developed a diagnostic reagent for hepatitis C (Science, 244, 359-362. 
75 362-364, (1989)). 

In Japan, the Japanese Red Cross Society has been using the diagnostic reagent in the screening of 
blood provided by donors, which is known as "CI 00-3 antibody test", in order to avoid post-transfusion 
hepatitis since the end of 1989. However, since not all samples are effectively screened only by ClOO-3 
antibody test, post-transfusion hepatitis is not completely avoided. 

20 Subsequently, further investigation of HCV genomes derived from the serum of a Japanese patient by 
the cloning technique revealed that HCV prevailed in Japan is similar to HCV obtained by Chiron Co., Ltd. 
but a different strain (Protein, Nucleic acid and Enzyme,36, 1679-1691, (1991)). In addition, the use of the 
core protein (C) region of the structural protein, the third non-structural protein (NS3) region, the fifth non- 
structural protein region and the like have been proposed as more effective diagnostic reagents than C100- 

25 3 (Lancet, 337, 317-319, 1991 and Japanese unexamined patent publication (hereinafter referred to as "J. 
P. KOKAr)"No. Hei 3-103180). 

The CI 00-3 antibody test system has a disadvantage that the detection rate and the sensitivity are low 
as mentioned above. Although proteins derived from C, NS3 and NS5 regions have been proposed as more 
effective antigens for detection than C100-3, any satisfactory results have not yet been reported. Therefore, 

30 there is a need for a diagnostic reagent and a diagnostic method for hepatitis C, having a higher detection 
rate and sensitivity. 

SUMMARY OF THE INVENTION 

35 The inventors have conducted various Investigations to obtain a diagnostic reagent for hepatitis C, 
having a higher detection rate and sensitivity. As a result, they have found that E2/NS1 protein having a 
suger chain, which is obtained by expressing cDNA of E2/NS1 region in animal cells reacts with the serum 
of the patient of hepatitis C with a high rate in a fluorescent antibody test and accomplished the goals of the 
present Invention. The high reaction rate of E2/NS1 region with the serum of the patient of hepatitis C was 

40 unexpected because the protein derived from E2/NS1 region is susceptible to the mutation of an amino acid 
sequence and, therefore, the protein expressed in E.coli has been considered to react with the serum of the 
patient of hepatitis C with a lower rate comparing with the proteins derived from the other regions of HCV 
and it has not been expected to use the protein for a diagnostic reagent. 

The present Invention provides a diagnostic reagent for hepatitis C, which detects an antibody Induced 

45 by infection of hepatitis C virus, characterised in that said diagnostic reagent comprises the second 
envelope protein or the first non-structural protein which is encoded by the genome of hepatitis C virus and 
has a suger chain. 

BRIEF DESCRIPTION OF THE DRAWINGS 

50 

Fig. 1 shows the steps of constructing DNA fragment 1325SK containing the base sequence of clone 
J1-1325. 

Fig. 2 shows the steps of constructing plasmid pSR316EP. 
Fig. 3 shows the steps of constructing plasmid pSRNot. 
55 Rg. 4 shows the steps of constructing expression vector paSR1325X-3 having a DNA fragment coding 
for E2/NS1 protein. 

Rg. 5 shows th steps of constructing plasmid pHLpI . 
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Rg. 6 shows the steps of constructing expression vector mulcos pHL16SR1325 having 16 DNA 
fragments coding for E2/NS1 protein. 

DETAILED EXPLANATION OF THE INVENTION 

5 

E2/NS1 protein of the present invention is a protein derived from the region called the second envelope 
protein or the first nonstructural protein, which is encoded by the genome of HCV. Examples of the proteins 
are illustrated in SEQUENCE ID Nos.1-12 in SUQUENCE LISTING. Proteins obtained from such proteins by 
deleting, inserting, modifying or adding a part of amino acids are encompassed In the scope of the present 
JO invention provided that they maintain the reactivity with the serum of the patient of hepatitis 0. 

(1) Method of preparing clones of cDNA derived from the serum of the patient of hepatitis C, which are 
shown In SEQUENCE ID Nos. 1-3 of SEQUENCE LISTING and determining the base sequence thereof 

75 Genes or DNA fragments coding for novel polypeptides, which are shown in SEQUENCE ID Nos. 1-3 of 
SEQUENCE LISTING can be prepared, for example, by a method described below. 

Since there exists a trace of HCV in the serum and the genome of HCV Is expected to be RNA, it was 
expected that cloning by Okayama-Berg method or Gubler-Hoffman method of the prior art would be 
attended by difficulties and, therefore, the following method was conducted to ensure the cloning of the 

20 gene susceptible to mutation from a trace of the serum. 

The nucleic acid is extracted from the serum of the patient of hepatitis C as described in Example 1 
later. Generally, it is preferred to use the serum having an OD value of 3.5 or more measured by a test kit 
of Ortho Inc. However, the present invention is not limited to the use of the serum having such an OD value. 
The serum is preferably mixed with transfer RNA (tRNA) as a carrier of virus RNA. The carrier is not limited 

25 to tRNA. Any polyribonucleoside can be used as carriers. If tRNA is used, there is an advantage that it can 
be rapidly confirmed by electrophoresis whether there is a required amount of tRNA having an intact length. 
By this confimation, it can also be confirmed whether virus RNA degradates after being mixed with tRNA as 
a carrier of virus RNA. As a technique of cloning cDNA from the nucleic acid, it is preferred to use 
polymerase chain reaction method developed by Saiki et al. (PCR method, Nature, 324, 126, (1986)). First 

30 of all, a reverse transcriptase is reacted using virus RNA as a template. In the reaction, any commercially 
available random primers or synthesized DNA having a base sequence similar to that of primer AS1 which 
is shown below may be used as a primer. 

5* 3' 

35 

AS1;GCTATCAGCAGCATCATCCA SEQUENCE ID No. 13 

A few bases at the 5' end of these sequences may be changed to other bases. Preferably, a few bases 
40 within 10 bases from the 5' end and more preferably, a few bases within 5 bases from the 5' end may be 
changed to other bases. In addition, 4-5 bases, preferably a few bases may be deleted from the sequences 
at the 5' end of these sequences. Furthermore, any 8-12 bases, preferably 5-6 bases, more preferably a 
few bases, may be added to the sequences at the 5' end of these sequences. 

PCR method is specifically carried out under the conditions described in Example 1 . PCR method is 
45 carried out as described in Example 1 using the first complementary DNA (1st cDNA) thus obtained as a 
template to prepare a desired DNA fragment. The conditions of PCR method are suitably selected 
depending on the cicumstances. Representative examples of sense primers include the following one: 



50 5 » 3 • 

S1:CAGITAITCCGGATCCCICAAG SEQUENCE ID No. 14 

55 "1" appearing in the sequenc means inosine. A few bases at the 5' end of these sequences may be 
changed to other bas s. Preferably, a few bases within 10 bases, more preferably, within 5 bases from the 
5* end may b changed to oth r bases. In addition. 4-5 bases, preferably a few bases may be deleted from 
the sequences at the 5' end of these sequences. Furthermore, any 8-12 bases, preferably 5-6 bases, more 

3 
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preferably a few bases may b added to the sequences at the 5' nd of these sequenc s. 

The DNA fragment thus obtained is inserted at one of cloning sites such as Sma I site of a cloning 
vector such as pUC19 according to conventional technique. Using a plasmid having this DNA fragment, the 
base sequences of at least 3 clones are determined independently regarding the both strands. The 

5 determination of the base sequences can be easily carried out by a dideoxy method using, for example, 7- 
deaza sequence kit availble from Takara Shuzo Co., Ltd. or fluorescence sequencer GENESIS 2000 system 
available from Du Pont according to the protocol thereof. When the DNA fragment has a site which Is 
considered difficult to determine the base sequence or has more than about 180 base pairs, a subcloning 
may be carried out according to conventional technique. SEQUENCE ID Nos.1-3 of SEQUENCE LISTING 

10 show the amino acid sequences of the proteins assumed from the base sequences of the DNA fragments 
thus determined. 

Clone J1-1325 (SEQUENCE ID No.1), clone N27. clone N19, H19 and Y19 (SEQUENCE ID No. 3) were 
prepared with the serums of different patients. Clone MX24 (SEQUENCE ID No.3) was prepared with a pool 
of the serums of the patients of hepatitis C. The clones shown in SEQUENCE ID Nos.1-3, which were 
75 prepared using a combination of primer S1 with primer AS1 correspond to the same region in the gene of 
HCV. 

Antigen proteins derived from E2/NS1 protein regions shown in SEQUENCE ID Nos.4-12 of SE- 
QUENCE LISTING can also be used in the present invention. 

The antigen protein of SEQUENCE ID No.4 can be obtained by expressing cDNA described in Journal 

20 of Virology, 65, 1105-1113, (1991). The antigen protein of SEQUENCE ID No.5 can be obtained by 
expressing cDNA described in Proceedings of the National Academy of Sciences of the USA, 87, 9524- 
9528, (1990). The antigen protein of SEQUENCE ID No.6 can be obtained by expressing cDNA described 
in The fiftieth general meeting of Japanese Cancer Society, 379, (1991). The antigen protein of SEQUENCE 
ID No.7 can be obtained by expressing cDNA described in European Patent No.0,388,232 (1990). The 

25 antigen proteins of SEQUENCE ID Nos.8 and 9 can be obtained by expressing cDNAs described in 
Proceedings of the National Academy of Sciences of the USA, 08^ 3392-3396, (1991). The antigen proteins 
of SEQUENCE ID Nos.10 and 11 can be obtained by expressing cDNAs described in Japanese Journal of 
Experimental Medicine, 60, 167-177, (1990). The antigen protein of SEQUENCE ID No.12 can be obtained 
by expressing cDNA described in Biochemical and Biophysical Research Communications, 175 , 220-228, 

30 (1991). The sequences shown in SEQUENCE ID Nos.4-12 correspond to the same region as that of the 
sequences shown in SEQUENCE ID Nos.1-3. (2) Expression of polypeptides encoded by the clones 
prepared in step (1) 

In order to produce E2/NS1 protein, it is necessary to select an appropriate host-vector system which is 
able to stably express the protein. Further, it is required that the expressed E2/NS1 protein has the same 

35 level of biological activity, that is, antigenicity as that of HCV. Considering that natural E2/NS1 protein is 
expected to be a glycoprotein and that E2/NS1 protein contains many cysteine residues and the positions 
of the thiol bonds between the cysteine residues and the higher-order structure of the protein are important 
to maintain the activity, it is desired to express the protein in such an animal cell host as CHO cell, COS 
cell, mouse L cell, mouse CI 27 cell and mouse FM3A cell, preferably CHO cell. When these cells are used 

40 as hosts, it is expected that processed E2/NS1 protein is produced by introducing E2/NS1 gene having a 
signal-like sequence of from the 32 position to the 44 position of the amino acid sequences shown in 
SEQUENCE ID Nos.1-12 into the cell. Expression plasmids for these animal host cells can be constructed 
as follows: 

As promoters in the animal cells, one can use the active-type promoter of adenovirus EIA gene 
45 (Biochemical Experiment Lecture, second series. Vol. 1, Techniques for gene investigations II, 189-190 
(1986)). the early promoter of SV40. the late promoter of SV40, the promoter of apolipoprotein E gene and 
SR a promoter (Molecular and Celluar Biology, 8, 466-472, (1988)). preferably the promoter of SV40 and 
SR a promoter. 

A DNA fragment of a gene coding for E2/NS1 protein containing the signal-like sequence is inserted 
so downstream of the promoter in a direction of the transcription. When the expression vector of E2/NS1 
protein is constructed, a ligated gene fragment of at least two gene fragments coding for E2/NS1 protein 
may be inserted downstream of the promoter. At least two units of DNA fragments ligated upstream of the 
5' end of the D NA fragment of the gene coding for E2/NS1 protein with such a promoter as that of SV40 
may be ligated together in the same direction of the transcription and then inserted in the vector. 
55 Polyadenylation sequence is required to be present downstream of the gene coding for E2/NS1 protein. For 
example, at least one of polyadenylation sequences derived from SV40 gene, ^-globin gene or metal- 
lothionein gene is required to be pr s nt downstream of th g ne coding for E2/NS1 prot in. When at least 
two of the DNA fragments containing the gene coding for E2/NS1 protein ligated to the promoter are 
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ligated, th polyadenylation s quenc may be pr s nt at ach 3' end of the gen coding for E2/NS1 
protein. 

In transforming an animal cell such as CHO cell with this expression vector, the use of a selective 
marker is desired. Examples of the selective markers include DHFR gene expressing methotrexate 

5 resistance (Journal of Molecular Biology. 159, 601, (1982)), Neo gene expressing antibiotic G-418 resistance 
(Journal of Molecular Applied Genetics, 1^ 327, (1982)), Ecogpt gene derived from E. coli, expressing 
mycophenol acid resistance (Proceedings~~of the National Academy of Sciences of the USA, 78, 2072, 
(1981)), hph gene expressing antibiotic hygromycin resistance (Molecular and Celluar Biology, 5, 410. 
(1985)) and the like. A promoter such as the aforementioned promoter derived from SV40 and the promoter 

10 of TK gene of Herpes virus is inserted upstream of the 5' end of each drug resistance gene. The 
aforementioned polyadenylation sequence are contained downstream of the 3* end of each drug resistance 
gene. When such a drug resistance gene is inserted in the expression vector of E2/NS1 protein, it may be 
inserted downstream of the polyadenylated site in the gene coding for E2/NS1 protein in a right direction or 
a reverse direction. These expression vectors do not require any co-transfection with another plasmid 

75 containing a selective marker gene in preparing a transfect. 

In the case where such a selective marker gene is not inserted in the expression vector of E2/NS1 
protein, a vector having a selective marker of the transfect, such as pSV2neo (Journal of Molecular Applied 
Genetics. 1, 327, (1982)), pMBG (Nature, 294, 228, (1981)). pSV2gpt (Proceedings of the National Academy 
of Sciences of the USA, 78, 2072. (1981))TlpAd-D26-1 (Journal of Molecular Biology, 159, 601, (1982)) and 

20 the like may be used together with the expression vector of E2/NS1 protein to conduct co-transfection. The 
transfect can be easily selected by gene expression of the selective marker gene. 

Examples of methods of introducing the expression vector into the animal cell include calcium 
phosphate method (Virology. 52, 456, (1973)) and electroporation method (Journal of Membrane Biology, 
10, 279. (1972)). Calcium phos^ate method is used in general. 

25 The transfected animal cell can be cultured by a float culture or an adherent culture in the conventional 
manner. The cultivation can be conducted in a medium such as MEM. Ham, F-12 and the like in the 
presence of 5-10 % of serum or a suitable amount of insulin, dexamethasone and transferrin or in the 
absence of serum. The animal cell expressing E2/NS1 protein can be detected by fluorescent antibody 
technique using the serum of the patient according to the conventional method. The cloning is carried out 

30 by limiting dilution according to the conventional method to establish a cell line stably producing E2/NS1 
protein, 

E2/NS1 protein derived from HCV gene, thus obtained can be used as HCV antigen which reacts 
immunologically with the serum containing HCV antibody and therefore, is useful for the confirmation or the 
detection of the presence of Anti-HCV antibody in samples including blood or serum. Examples of the 

35 immunoassays include RIA (radioimmunoassay). ELISA (enzyme-linked immunoadosorbent assay), fluores- 
cent antibody technique, agglutination reaction including latex fixation, immune precipitation and the like. In 
the detection, a labelled antibody is usually used. A labelling substance such as a fluorescent substance, a 
chemoluminescent substance, a radioactive substance, a dyeing substance and the like can be used. 
Accordingly, using the above E2/NS1 protein derived from HCV gene as an antigen, the diagnostic reagent 

40 for hepatitis C according to the present invention can be prepared. 

The reagent containing the protein having a sugar chain, which is derived from E2/NS1 region 
according to the present invention makes the confirmation or the detection of the presence of anti-HCV 
antibody in samples including blood or serum possible. The use of the reagent according to the present 
invention makes highly sensitive diagnosis of hepatitis C possible. 

45 The present invention will be explained in more detail with reference to the following non-limiting 
examples. 

Example 1 

50 (1) Extraction of the nucleic acid from the serum of the patient of hepatitis C 

Twenty-five milliliters of a Tris buffer (50 mM Tris-HCI (pH 8.0), 1 mM EDTA and 100 mM NaCI) were 
added to 10 ml of the serum of the patient of hepatitis 0, which showed at least 3.5 of an OD value by a 
HCV EIA kit available from Ortho Inc. After being mixed, the mixture was centrifuged at 20,000 x g at 20 ' C 
55 for 20 minutes. The obtained supernatant was centrifuged at 100.000 x g at 20 'C for additional 5 hours. 
One point fiv milliliters of a Prot nase K solution (1% sodium dod cyl sulfate. 10 mM EDTA, lOmM Tris- 
HCI (pH 7.5), 2 mg/ml Protenase K (availabi from Pharmacia Co.) and 6.6 u g of a yeast tRNA mixtur ) 
were added to the precipitate. After the precipitate was dissolved in the Protenase K solution, the obtained 
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solution was maintained at 45 'C for 90 minutes. Th mixture was subjected at least four tim s to a 
phenol/chloroform treatment which comprises the steps of adding an equivalent amount of 
phenol/chloroform, violently agitating and then c ntrifuging the mixture to coll ct an aqueous phase 
containing a nucleic acid. Then, a chloroform treatment was carried out at least 2 times. To the obtained 
* 5 aqueous phase, one-tenth amount of 3M sodium acetate or an equivalent amount of 4M ammonium acetate, 
and 2.5-fold volume of ethanol were added and the mixture was left to stand at -20 * C overnight or -80 * C 
for at least 15 minutes. The mixture was centrifuged at 35,000 rpm for 4 hours by a SW41Ti rotor (available 
from Beckmann Co.) to collect a nucleic acid as a precipitate. 

10 (2) Synthesis of cDNA 

(2-1) Synthesis of an RNA sample 

After the nucleic acid obtained in step (1) was dried, 30 u I of water and 10 u I of ribonuclease inhibitor 
15 (100 units/ u I, available from Takara Shuzo Co.. Ltd.) were added thereto to dissolve the nucleic acid. The 
following synthesis of cDNA was carried out using the obtained nucleic acid solution. 

(2-2) Synthesis of cDNA using an anti-sense primer 

20 To 2 u I of the aqueous solution of the nucleic acid prepared in step (2-1), 1 u I of an anti-sense primer 
(synthesized DNA primer AS1 ; 15 pmoles/ u I), 2 u I of lOxRT buffer (lOOmM Tris-HCI (pH 8.3) and 500 
mM of KCI), 4 u I of 25 mM MgCl2. 8 u I of 2.5 mM 4dNTP and 1 u I of water were added and the mixture 
was maintained at 65*C for 5 minutes and at room temperature for 5 minutes. Subsequently, 1 u I of 25 
units of a reverse transcriptase (available from Life Science Co.) and 1 u I of a ribonuclease inhibitor (100 

25 units/ u I, available from Takara Shuzo Co., Ltd.) were added to the mixture and then the resulting mixture 
was maintained at 37 • C for 20 minutes, then at 42 • C for 30 minutes and finally at 95 • C for 2 minutes. 
Immediately thereafter, the mixture was cooled to O'C (Synthesis of complementary DNA). The DNA 
having a specific sequence was amplified using 10 u I of the DNA sample according to Saiki's method 
(Nature, 324, 126, (1986)), so-called PCR method as follows: 

30 Water was added to a mixture of 10 u I of the above DNA sample, 10 u I of lOxPCR buffer (100 mM of 
Tris-HCI (pH 8.3), 500 mM of KCI, 15 mM of MgCb , and 1 % of gelatin). 8 U I of 2.5 mM 4dNTP, 2 u I of 
the synthesized DNA primer used in the synthesis of the complementary DNA (150 pmoles/ u I), 3 u I of a 
synthesized DNA primer corresponding to the DNA primer (15 pmoles/ u I) (which is complementary to the 
synthesized DNA primer used In the synthesis of the complementary DNA, i.e., the aforementioned primer 

35 SI) to prepare 100 u I of an aqueous solution. After the solution was maintained at 95*C for 5 minutes. It 
was cooled rapidly to O'C. One minute after the cooling, the solution was mixed with 0.5 u I of Taq DNA 
polymerase (7 units/ u I, Trade Name "AmpliTaqTM" available from Takara Shuzo Co., Ltd.) and then 
mineral oil was layered on the mixture. This sample was incubated on a DNA Thermal Cycler available from 
Parkin Elmer Cetus Co. at 95 "C for 1 minute, at 40-55 'C for 1 minute, and at 72'C for 1-5 minutes for 

40 25 cycles. After the sample was incubated finally at 72 * C for 7 minutes, the reaction aqueous solution was 
subjected to a phenol/chloroform treatment and a precipitation treatment with ethanol to obtain amplified 
DNA fragments. The above precipitation treatment with ethanol was carried out by mixing the aqueous 
phase with a one-tenth amount of 3 M sodium acetate or an equivalent amount of 4 M ammonium acetate 
together with a 2.5-fold volume of ethanol, centrlfuging the mixture at 15.000 rpm at 4' C for15 minutes by 

45 a rotor having a radius of about 5 cm and drying the precipitate. 

(3) Cloning of the amplified DNA fragments and Determination of the base sequences thereof 

At least 1 pmole of the DNA fragments obtained by the method described in step (2-2) was treated with 
50 T4 DNA polymerase (available from TOYOBO CO.,LTD) to make blunt ends (Molecular Cloning, 1982, Cold 
Spring Harbor Laboratory Press). After a phosphoric acid group was introduced into the DNA fragment at 
the 5' end with polynucleotidekinase (available from TOYOBO CO..LTD) (Molecular Cloning, 1982, Cold 
Spring Harbor Laboratory Press), the DNA fragment was Inserted at Sma I site present in the multicloning 
sites of pUC19 cloning vector using a ligation kit (available from Takara Shuzo Co.. Ltd.). 
55 The vector DNA prepared In the following procedur was used In the ligation in an amount of 5-10 ng. 

pUC18 cloning vector was cleaved with restriction enzyme Sma I (available from TOYOBO CO.,LTD) 
and then subjected to a phenol/chloroform treatment and a precipitaion treatm nt with ethanol. Subse- 
quently, this was treated with alkaline phosphatase (available from Boehrlnger Mannheim) to conduct the 
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dephosphorylation at th 5* end (Molecular Cloning. 1982, Cold Spring Harbor Laboratory Press), followed 
by a phenol/chloroform treatment and a precipitation with ethanol. The competent c II of E.coli Jf^109 or 
DH5 (available from TOYOBO CO.,LTD) was transformed with the DNA prepared in the above procedure. 
The procedure of the transformation was according to the protocol of COMPETENT HIGH prepared by 
5 TOYOBO CO.,LTD. At least 20 transformants transformed with the pUC18 cloning vector having the DNA 
fragment obtained by the method described in step (2-2) using the combination of the aforementioned 
primers were prepared. 

Plasmid DNA pUC1325 shown in Fig. 1 was prepared from the obtained transformant in the conven- 
tional method and the base sequence of the plasmid was determined by a 7-deaza sequence kit available 

70 from Takara Shuzo Co.. Ltd. or a fluorescence sequencer GENESIS 2000 system available from Du Pont. 
Two kinds of synthesized primers. 5'd(GTAAAACGACGGCCAGT)3* (SEQUENCE ID No. 15) and 5'd- 
(CAGGAAACAGCTATGAC) 3* (SEQUENCE ID No. 16) were used to determine a base sequence of the + 
strand and that of the - strand of the DNA fragment. The DNA fragment had the same base sequence as 
that shown in SEQUENCE ID No. 1 of SEQUENCE LISTING. The amino acid sequence shown in 

75 SEQUENCE ID No. 1 of SEQUENCE LISTING is encoded by the + strand of the gene derived from HCV 
and inserted In the plasmid of the transformant. 

The amino acid sequence encoded by the DNA fragment obtained was compared with the reported 
sequences of hepatitis C viruses. In step (2-2) of Example 1, three clones were obtained from the serum of 
one patient. The determination of the base sequence of the clones reveals that the patient carries several 

20 kinds of viruses. 

(4) Preparation of a plasmid expressing E2/NS1 protein 

Rgs. 1-6 show a procedure of preparing a plasmid expressing E2/NS1 protein. 

25 

(4-1) Preparation of DNA fragment 1325SK 

The DNA fragment of clone 1325 contained in plasmid pUC1325 obtained in step (3) was inserted at 
Sma I site of pUC18 so that the fragment had Kpnl site of pUC18 at the 5' end of the + strand of clone 

30 1 325 coding for E2/NS1 protein and Bam HI site of pUC18 at the 3' end. After complete digestion with 
restriction enzyme HIn dill, the fragment was partially digested with restriction enzyme Bam HI to obtain a 
DNA fragment which was cleaved not at Bam HI site within the vector but only at another Bam HI site 
present in clone 1325. The DNA fragment contains from the Bam HI site present at the 5' end to the 3* end 
of clone 1325 which was the DNA fragment obtained in step (2-2), which was derived from the gene of 

35 HCV. 

Subsequently, as shown in Fig. 1, the DNA fragment was treated with T4 DNA polymerase to make 
blunt ends. After being ligated with Spel linker consisting of the sequence of 5' pGGACTAGTCC 3' 
(SEQUENCE ID No. 17) (available from~New England Biolab Co.), the fragment was cleaved with restriction 
enzyme Xba I (the Xba I site of the fragment was derived from plasmid pUC18). The following adaptor was 
40 ligated to Xba I site at the 3* end to obtain DNA fragment 1 325SK. 



5' pCTAGAGAATTCGGTAC 3' (SEQUENCE ID No. 18) 
3 ' TCTTAAGCp 5 ' 



(4-2) Construction of plasmid pSRNot 

50 

Expression vector pAC316 reported in Journal of Virology, 65, 3015-3021, (1991) was cleaved with 
restriction enzyme Tth 1111 at Tthllll site present at the 3' end "of 3' poly A region. T4 DNA polymerase 
was acted on the cleaved vectoTTo make blunt ends. The fragment between Sail sit and Eco Rl site of 
plasmid pmoRH (Rg. 2) reported by Ikeda et al (Gene, 71^, 19-27, (1988)) was cut out and T4 DNA 
55 polymerase was acted on the fragment to mak blunt ends. 

As shown In Fig. 2, the DNA fragment derived from pAC316 and the DNA fragm nt derived from 
pmoRH were ligated together with Bgl II link r (available from Takara Shuzo Co.. Ltd.) to obtain plasmid 
pSR3l6EP containing one Bglll linker and one DNA fragment containing the early promoter of SV40 derived 
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from pmoRH. As shown in Rg. 3, after plasmid pSR316EP was cleaved with restriction nzymes Hgi Al 
and Dra III. T4 DNA polymerase was acted on th plasmid to make blunt ends. Then, one Not I linker was 
introduced in the plasmid to obtain plasmid psRNot (Fig. 3). Namely, NotI linker was~~pr pared by 
synthesizing DNA having a sequence of 5* AGCGGCCGC 3' and phosphorylating the 5* end by kination 
" 5 (Molecular Cloning second eddition. 11.31-11.44. (1989), Cold Spring Harbor Labratory Press). 

Subsequently, dhfr gene was cut out from plasmid pCHD2L reported by Ikeda et al in Gene. 71. 19-27, 
(1988) using restriction enzymes Kpn I and Eco RV and Kpn I- EcoRV fragment of plasmid Charomid9-36 
described in Proceedings of the National Academy of Sciences of the USA, 83. 8664-8668. (1986) was 
Inserted in the deleted dhfr gene region instead of the Kpnl- EcoRV fragment coding for dhfr gene as 
10 shown in Fig. 5 to obtain plasmid pChmBpl. The plasmid contains a polylinker derived from plasmid 
Charomid9-36. 

Then, plasmid pAGGO reported by Garapin et al. In Journal of Molecular Biology. 150 , 1-14, (1981) 
was cleaved with restriction enzyme Pvu II to obtain a Pvu II fragment coding for a neomycin gene. After 
plasmid pChmBpl was cleaved with restriction enzyme Eco RV and then T4 DNA polymerase was acted to 
75 make blunt ends, the fragment obtained was ligated to the Pvu II fragment to obtain plasmid pHLpI which 
contained the neomycin gene derived from plasmid pAG60 at the Eco RV site of plasmid pChmBpl (Rg. 
5). 

(4-3) Construction of expression vector paSRI 325X-3 

20 

As shown in Fig. 4, after plasmid pSRNot obtained in step (4-2) was cleaved with restriction enzyme 
Not I and then with T4 DNA polymerase to make blunt ends, this was cleaved with restriction enzyme 
Kpn I. The obtained DNA fragment was ligated to DNA fragment 1325SK obtained In step (4-1) to obtain 
expression vector paSR1325X-3 having only one DNA fragment 1325SK (Fig. 4). 

25 

(4-4) Construction of expression vector pHL16SR1325 

As shown in Fig. 6, expression vector paSR1325X-3 obtained in step (4-3) was cleaved with restriction 
enzyme Sfi I to prepare two fragments one of which was an expression unit of clone 1325. The Sfi I sites 

30 were present in an Initial promoter of SV40. Five u g of the Sfi I fragment having the expressin unit of 
clone 1325 was ligated to 50 ng of the fragment obtained by cleaving expression vector pHLpI with 
restriction enzyme Sfi I in 10 u I of a reaction solution using a ligation kit available from Takara Shuzo Co., 
Ltd. according to a protocol for the ligation kit to obtain expression vector pHL16SR1325 (Fig. 6). 

The vector had successive sixteen DNA fragments 1325SK having at the Sfi I site of expression vector 

35 paSR1325X-3 the expression unit of clone 1325 which was a gene coding for E2/NS1 protein. In the vector, 
all of the DNA fragments 1325SK were inserted downstream of SV40 promoter of expression vector 
paSR1325X-3 in a direction of transcription. 

(5) Obtaining a cell line constantly expressing E2/NS1 protein 

40 

Expression vector pHL16SR1325 prepared in step (4) was recovered from the recombinant E.coli DH1 
strain, purified according to the conventional technique described in Molecular Cloning second edition. 
1989, Cold Spring Harbor Laboratory Press to obtain a large amount of the expression plasmid DNA. CHO 
cells were transfected with the plasmid DNA according to the method described In Ausubel et al. (Current 
45 Protocols In Molecular Biology. Greene Publishing Associates and Wlley-lnterscience, Capter 9.1.1-9.1.4. 
(1987)) as follows: 

CHO cells were cultured in Ham F-12 medium containing 10 % of fetal calf serum (FCS) in a plate 
having a diamer of 6 cm until the cells were In semiconfluent condition. Then, the medium was removed 
from the plate and a DNA solution was dropwise added thereto. The DNA solution was previously prepared 

50 by the following procedure. 

Three hundreds u, 1 of 2xHEBS solution (2xHEBS solution ; 1.6 % sodium chloride, 0.074 % potassium 
chloride, 0.05 % Na2HP04 * 12H2O, 0.2 % dextrose and 1 % HEPES (pH 7.05)) were mixed with 10 ti g of 
the plasmid DNA in each plate and sterlliz d wat r was added to the mixture to prepare a solution of 570 u 
I. The solution was charged In an Eppendorf centrifuge tube. The DNA solution was violently agitated by a 

55 Vortex mixer for 1-2 seconds while adding 30 u I of 2.5 M calcium chlorid solution th reto. The DNA 
solution was agitated by a Vortex mixer at about 10-mlnute intervals during being left to stand for 30 
minutes. Th obtained DNA solution was add d to the afor m ntioned CHO c lis and the CHO cells were 
left to stand at room temperature for 30 minutes. Then. 5 ml of Ham F-12 medium containing 10 % of FCS 
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available from GIBCO Co. were add d to the plate and the cultur was incubated at 37 *C under air 
containing 5 % carbon dioxide for 4-5 hours. Subsequently, the medium was removed from the plate and 
the cells wer washed with 5 ml of a 1xTBS + + solution (1xTBS + + solution ; 25 mM Tris-HCI (pH 7.5). 
140 mM sodium chloride, 5mM potassium chloride, 0.6 mM disodium hydrogen phosphate, 0.08 mM 

5 calcium chloride and 0.08 mM magnesium chloride). After the 1xTBS + + solution was removed, 5 ml of a 
1xTBS + + solution containing 20 % of glycerol was added to the cells and the culture was left to stand at 
room temperature for 1-2 minutes. After the supernatant was removed from the plate, the cells were washed 
again with 5 ml of a IxTBS + + solution and cultured in 5 ml of fresh Ham F-12 medium containing 10 % 
of FCS in the plate at 37 • C under air containing 5 % carbon dioxide for 48 hours. Then, the medium was 

70 removed and the cells were washed with 5 ml of a IxTBS + + solution. The cells were treated with a 
trypsin-EDTA solution (available from Sigma Co.) and left to stand at room temperature for 30 seconds. Five 
minutes after the trypsin-EDTA solution was removed, the cells attached to the wall of the plate were peeled 
adding 5 ml of Ham F-12 medium containing 10 % of FCS. The cells cultured in one plate having a 
diameter of 5 cm were divided in ten plates having a diameter of 9 cm and cultured in the plates containing 

75 drug G418 (G418 sulfate (GENETICIN) available from GIBCO Co.) in a concentration of 600 u g /ml. Ten 
days after the cultivation, grown cells having G418 resistance were isolated and cultured for about 7 days in 
1 ml of Ham F-12 medium containing 10 % of FCS in a 24 well titer plate each well of which has an area of 
about 3.1 cm2. 

A part of the cells were clutured on slide glass (Lab-Tek Chamber Slides, Nunc4808 available from 
20 Japan Inter Med Co.) overnight. After being rinsed with phosphate buffered saline (PBS), the slide glass was 
immersed in cold actone-methanol (1:1) solution and maintained at -20 'C for 15 minutes to fix the cells. 
The cells fixed on the slide glass were reacted with the serum of the patient of hepatitis C 20-fold diluted 
with PBS at 37* C for 30 minutes. Then, the slide glass was washed three times with PBS for 5 minutes and 
reacted with FITC-labelled rabbit anti-human IgG (available from Daco Japan Co.) 50-fold diluted with PBS 
25 at 37 'C for 30 minutes. The slide glass was washed three times with PBS for 5 minutes and dried by 
putting the slide glass between two pieces of filter paper. After the slide glass was sealed with glycerin, the 
cells on the slide glass were observed under a fluorescence microscope. Screening positive cells as 
descrived above, successive three times of limiting dilution were carried out to establish cell line 13L20 
constantly producing E2/NS1 protein. 

30 

(6) Study of the reactivity of 13L20 cells with the serum of the patient of hepatitis C 

After 13L20 cells established in step (5) were cultured on Lab-Tek Chamber Slides (Lab-Tek Chamber 
Slides, Nunc4808 available from Japan Inter Med Co.) overnight and then fixed with a cold acetone- 
35 methanol solution, the fixed cells were reacted with 59 serum samples of the patients of hepatitis C. Then, 
the cells were washed as described above and reacted with the secondary antibody. The observation under 
a fluorescence microscope revealed that 53 samples were positive. Among the 59 serum samples. 6 
samples were judged to be positive using CHO cells constantly producing the first envelope region of HCV. 

40 Example 2 

Using as a template the DNA fragment described in Example 1 1 (3) of the specification of European 
Patent Application No. 92109812.5 filed on June 11, 1992 (TITLE OF THE INVENTION "Gene or DNA 
fragments derived from hepatitis C virus, polypeptides encoded thereby, and method of producing 

45 thereof"), PCR reaction was carried out in the same manner as that of Example 1 using the same primer to 
obtain a DNA fragment corresponding the same region as that of clone J1-1325 shown in SEQUENCE ID 
No. 1 of SEQUENCE LISTING. The region was a DNA fragment encoding for E2/NS1 protein like clone J1- 
1325. For example, using as a template the DNA fragment clone N27MX24A-1 having a base sequence 
shown in SEQUENCE ID No. 31 of SEQUENCE LISTING described in the specification of the aforemen- 

50 tioned European Patent Application filed on June 11, 1992, plasmid pUCN27MX24A-2 was obtained. The 
base sequence of the DNA fragment coding for E2/NS1 protein, which was cloned in the plasmid is shown 
in SEQUENCE ID No. 2 of SEQUENCE LISTING. In addition, MK2724A2 cell line constantly producing 
E2/NS1 protein was stablised by the same procedur as that described in steps (4) and (5) of Example 1 . 
The reactivity of th same samples as Example 1 with the cell line was estimated by the same method as 

55 that described in step (6) of Example 1. Results similar to those obtained in step (6) of Example 1 were 
obtained. 
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SEQUENCE LISTING 
(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(iv) ANTI -SENSE: No 

20 



(vi) ORIGINAL SOURCE: 
25 (A) ORIGIN: Hepatitis C virus 

(B) CLONE: Jl-1325 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



G ATC CCA CAA GCT GTC ATG GAC ATG GTG GCG GGG GCC CAC TGG GGA GTC 49 
He Pro Gin Ala Val Met Asp Met Val Ala Gly Ala His Trp Gly Val 
15 10 15 

CTA GCG GGC CTT GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT 97 
Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 

20 25 30 

TTG ATT GTG ATG CTA CTC TTT GCC GGC GTT GAC GGG CAT ACC CGC GTG 145 
Leu He Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val 

35 40 45 

ACG GGG GGG GTG CAA GGC CAT GTC ACC TCT ACA CTC ACG TCC CTC TTT 193 
Thr Gly Gly Val Gin Gly His Val Thr Ser Thr Leu Thr Ser Leu Phe 
50 55 60 
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15 



20 



25 



30 



35 



40 



45 



AGA CCT GGG GCG 
Arg Pro Gly Ala 
65 

TGG CAT ATC AAC 
Trp His lie Asn 



GGG TTT 
Gly Phe 

TGC CCG 
Cys Pro 

GGA TGG 
Gly Trp 
130 
CCG TAT 
Pro Tyr 
145 

TCG CAG 
Ser Gin 



CTT GCC 
Leu Ala 
100 
GAG CGC 
Glu Arg 
115 

GGT CCC 
Gly Pro 

TGC TGG 
Cys Trp 

GTG TGC 
Val Cys 



50 



GTG GGG ACG ACC 
Val Gly Thr Thr 
180 

AAT GAG ACG GAC 
Asn Glu Thr Asp 
195 

AAC TGG TTC GGC 
Asn Trp Phe Gly 



TCC CAG AAA 
Ser Gin Lys 
70 

AGG ACT GCC' 
Arg Thr Ala 
85 

GCG CTG TTC 
Ala Leu Phe 

ATG GCC AGC 
Met Ala Ser 

ATC ACC TAT 
lie Thr Tyr 
135 

CAC TAC GCA 
His Tyr Ala 

150 
GGT CCA GTG 
Gly Pro Val 
165 

GAT CGT TTC 
Asp Arg Phe 

GTG CTG CTC 
Val Leu Leu 

TGT ACA TGG 
Cys Thr Trp 



ATT CAG 
He Gin 

CTG AAC 
Leu Asn 

TAC ACA 
Tyr Thr 
105 
TGT CGC 
Cys Arg 
120 

GCT CAA 
Ala Gin 

CCT CGA 
Pro Arg 

TAT TGC 
Tyr Cys 

GGC GCC 
Gly Ala 
185 
CTA AAC 
Leu Asn 
200 

ATG AAT 
Met Asn 



CTT GTA 
Leu Val 
75 

TGC AAT 
Cys Asn 
90 

CAC AAG 
His Lys 

TCC ATT 
Ser He 

CCT GAC 
Pro Asp 

CAG TGT 
Gin Cys 
155 
TTC ACC 
Phe Thr 
170 

CCT ACG 
Pro Thr 

AAC ACG 
Asn Thr 

AGC ACT 
Ser Thr 



AAC ACC 
Asn Thr 

GAC TCC 
Asp Ser 

TTC AAC 
Phe Asn 

GAC AAG 
Asp Lys 
125 
AAC TCG 
Asn Ser 
140 

GGT ATC 
Gly He 

CCA AGC 
Pro Ser 

TAT AAC 
Tyr Asn 

CGG CCG 
Arg Pro 
205 
GGG TTC 
Gly Phe 



AAT GGC AGT 
Asn Gly Ser 
80 

CTC AAA ACT 
Leu Lys Thr 
95 

GCG TCC GGA 
Ala Ser Gly 
110 

TTC GAC CAG 
Phe Asp Gin 

GAC CAG AGG 
Asp Gin Arg 

GTA CCC GCG 
Val Pro Ala 
160 

CCT GTT GTA 
Pro Val Val 

175 
TGG GGG GAC 
Trp Gly Asp 
190 

CCG CAT GGC 
Pro His Gly 

ACC AAG ACG 
Thr Lys Thr 



241 



289 



337 



385 



433 



481 



529 



577 



625 



673 



55 



11 



EP 0 537 626 A1 



10 



15 



210 215 220 

TGC GGA GGC CCC CCG TGT AAC ATC AGG GGG GTC GGC AAC AAC ACC TTG 721 
Cys Gly Gly Pro Pro Cys Asn lie Arg Gly Val Gly Asn Asn Thr Leu 
225 230 235 240 

ACC TGC CCC ACG GAC TGC TTC CGG AAG CAC CCC GAC GCC ACT TAC ACA 769 
Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Asp Ala Thr Tyr Thr 

245 250 255 

AAA TGT GGT TCG GGC CCT TGG TTG ACA CCT AGG TGC TTG GTT GAC TAC 817 
Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Leu Val Asp Tyr 
260 265 270 

20 CCA TAC AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTT ACC ATC TTC 865 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe 
275 280 285 

25 AAG GTT AGG ATG TAT GTG GGG GGC GTG GAG CAC AGG CTT GAT GCT GCA 913 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Asp Ala Ala 

290 295 300 

TGC AAC TGG ACT CGA GGA GAG CGT TGC GAC TTG GAG GAC AGG GAT AGA 961 
Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
305 310 315 320 

GCA GAG CTC AGC CCG CTA CTG CTG TCT ACG ACA GAG TGG CAG GTA CTG 1009 
Ala Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Val Leu 

325 330 335 

CCC TGT TCC TTC ACC ACC CTA CCG GCT CTG TCC ACT GGT CTA ATC CAT 1057 
Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His 

340 345 350 

CTC CAT CAG AAC GTC GTG GAC GTG CAA TAC CTG TAC GGT ATA GGG TCA 1105 
Leu His Gin Asn Val Val Asp Val Gin Tyr Leu Tyr Gly He Gly Ser 
50 355 360 365 

GCA GTT GTC TCC TTT GTA ATC AAA TGG GAG TAT GTC CTG TTG CTT TTC 1153 
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Ala Val Val Ser Phe Val lie Lys Trp Glu Tyr Val Leu Leu Leu Phe 

370 375 380 

CTT CTC CTG GCT GAC GCA CGC GTC TGT GCC TGC TTG TGG ATG ATG CTG 
Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met Met Leu 
385 390 395 400 

CTG ATA 
Leu lie 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(iv) ANTI -SENSE: No 

(vi) ORIGINAL SOURCE: 

(A) ORIGIN: Hepatitis C virus 

(B) CLONE: N27MX24A-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

G ATC CCA CAA GCC GTG GTG GAT ATG GTG GCA GGG GCC CAC TGG GGA GTC 49 
lie Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
15 10 15 

CTG GCG GGC CTT GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTC 97 
Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 

20 25 30 

TTG GTT GTG ATG CTG CTC TTC GCC GGT GTT GAC GGG GGG ACC CAC GTG 145 
Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly Gly Thr His Val 

35 40 45 

ACA GGG GGG AAG GTA GCC TAC ACC ACC CAG GGC TTT ACA CCC TTC TTT 193 
Thr Gly Gly Lys Val Ala Tyr Thr Thr Gin Gly Phe Thr Pro Phe Phe 

50 55 60 

TCA CGA GGG CCG TCT CAG AAA ATC CAA CTT GTA AAC ACT AAC GGC AGC 241 
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Ser Arg Gly Pro Ser Gin Lys lie Gin Leu Val Asn Thr Asn Gly Ser 
65 70 75 80 

TGG CAC ATC AAT AGG ACT GCC CTC AAT TGC AAT GAC TCC CTT AAC ACC 289 
Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Asn Thr 

85 90 95 

GGG TTC CTT GCC GCG CTG TTC TAG ACC CAC AGC TTC AAC GCG TCC GGA 337 
Gly Phe Leu Ala Ala Leu Phe Tyr Thr His Ser Phe Asn Ala Ser Gly 

100 105 110 

TGT CCG GAG CGT ATG GCC GGT TGC CGC CCC ATT GAC GAG TTC GCT CAG 385 
Cys Pro Glu Arg Met Ala Gly Cys Arg Pro He Asp Glu Phe Ala Gin 

115 120 125 

GGG TGG GGT CCC ATC ACT CAT GTT GTG CCT AAC ATC TCG GAC CAG AGG 433 
Gly Trp Gly Pro He Thr His Val Val Pro Asn He Ser Asp Gin Arg 

130 135 140 

CCC TAT TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATC GTA CCC GCG 481 
Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

TCG CAG GTG TGT GGT CCG GTG TAT TGC TTC ACC CCA AGC CCT GTT GTG 529 
Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 

165 170 175 

GTG GGG ACG ACC GAT CGT TTC GGC GCC CCC ACG TAC AAC TGG GGA AAC 577 
Val Gly Thr Thr Asp Arg Phe Gly Ala Pro Thr Tyr Asn Trp Gly Asn 

180 185 190 

AAT GAG ACG GAT GTG CTA CTC CTC AAC AAC ACA CGG CCG CCG CAG GGC 625 
Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr Arg Pro Pro Gin Gly 

195 200 205 

AAC TGG TTC GGT TGT ACC TGG ATG AAT GGC ACT GGG TTC ACA AAG ACG 673 
Asn Trp Phe Gly Cys Thr- Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
210 215 220 



55 



15 



10 



15 



20 



EP 0 537 626 A1 

TGC GGG GGC GCC CCG TGC AAC ATC GGG GGG GTC GGC AAC AAT ACC TTG 721 
Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Val Gly Asn Asn Thr Leu 
225 230 235 240 

ACT TGC CCC ACG GAC TGC TTC CGG AAG CAC CCC GAG GCC ACT TAC ACA 769 
Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Thr 

245 250 255 

AAA TGT GGT TCG GGG CCT TGG TTG ACG CCT AGG TGC CTA GTT CAT TAC 817 
Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Leu Val His Tyr 

260 265 270 

CCA TAC AGG CTC TGG CAC TAT CCC TGC ACT GTC AAC TTT ACC ATC TTC 865 
Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie Phe 

275 280 285 

AAG GTT AGG ATG TAT GTG GGG GGC GTG GAA CAC AGG CTT GAA GCT GCA 913 
25 Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Glu Ala Ala 

290 295 300 

TGC AAT TGG ACC CGA GGA GAG CGT TGT GAC TTG GAG GAC AGG GAT AGA 961 
30 Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 

305 310 315 320 

TCA GAG CTT AGC CCG CTA TTG CTG TCC ACA ACA GAG TGG CAG GTA CTG 1009 
Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Val Leu 

325 330 335 

CCC TGT TCC TTC ACC ACC CTG CCG GCT CTG TCC ACT GGT TTG ATT CAT 1057 
Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu lie His 

340 345 350 

CTC CAT CAG AAC ATC GTG GAC GTG CAA TAT CTG TAC GGC ATA GGG TCG 1105 
Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly lie Gly Ser 

355 360 365 

GCG GTT GTC TCC TTC GCA ATC AAA TGG GAA TAT ATT CTG TTG CTT TTC 1153 
Ala Val Val Ser Phe Ala He Lys Trp Glu Tyr He Leu Leu Leu Phe 
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370 375 380 

CTC CTC CTG GCG GAG GOG CGC GTC TGT GCC TGC TTG TGG ATG ATG CTG 1201 
Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met Met Leu 
385 390 395 400 

CTG ATA 1207 
Leu lie 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 amino acids 

(B) TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORIGIN: Hepatitis C virus 

(B) CLONE: N27 ,N19 ,H19, Y19,MX24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 

15 10 15 

Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 

20 25 30 

Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly Gly Thr His Val 

Arg 

35 40 45 

Thr Gly Gly Lys Val Ala Tyr Thr Thr Gin Gly Phe Thr Pro Phe Phe 

Arg Ser 
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Ser 

50 55 60 

Ser Arg Gly Pro Ser Gin Lys lie Gin Leu Val Asn Thr Asn Gly Ser 

Arg 

65 70 75 80 

Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Asn Thr 

Gin 

85 90 95 

Gly Phe Leu Ala Ala Leu Phe Tyr Thr His Ser Phe Asn Ala Ser Gly 
Thr Arg Asp 

100 105 110 

Cys Pro Glu Arg Met Ala Gly Cys Arg Pro He Asp Glu Phe Ala Gin 

Ser Ser 
115 120 125 

Gly Trp Gly Pro He Thr His Val Val Pro Asn He Ser Asp Gin Arg 
Asp Asp Val 

130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 

Val 

145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
Trp 

165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Ala Pro Thr Tyr Asn Trp Gly Asn 

Ser Thr Ala 

180 185 190 

Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr Arg Pro Pro Gin Gly 

195 200 205 

Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
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210 215 220 

Cys Gly Gly Pro Pro Cys Asn He Gly Gly Val Gly Asn Asn Thr Leu 
225 230 235 240 

Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Thr 

245 250 255 

Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Leu Val His Tyr 

260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe 

275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Glu Ala Ala 

290 295 300 

Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
305 310 315 320 

Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Val Leu 

325 330 335 

Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His 

340 345 350 

Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly He Gly Ser 

355 360 365 

Ala Val Val Ser Phe Ala He Lys Trp Glu Tyr He Leu Leu Leu Phe 

370 375 380 

Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met Met Leu 
385 390 395 400 

Leu He 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1207 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(iv) ANTI -SENSE: No 

(vi) ORIGINAL SOURCE: 

(A) ORIGIN: Hepatitis C virus 

(B) CLONE: BK164 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



G ATC CCA CAA GCC GTC GTG GAC ATG GTG GCG GGG GCC CAC TGG GGA GTC 49 
lie Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
15 10 15 

CTG GCG GGC CTT GCC TAC TAT TCC ATG GCG GGG AAC TGG GCT AAG GTT 97 
30 Leu Ala Gly Leu Ala Tyr Tyr Ser Met Ala Gly Asn Trp Ala Lys Val 

20 25 30 

CTG ATT GTG ATG CTA CTT TTT GCT GGC GTT GAC GGG GAT ACC CAC GTG 145 
Leu He Val Met Leu Leu Phe Ala Gly Val Asp Gly Asp Thr His Val 

35 40 45 

ACA GGG GGG GCG CAA GCC AAA ACC ACC AAC AGG CTC GTG TCC ATG TTC 193 
Thr Gly Gly Ala Gin Ala Lys Thr Thr Asn Arg Leu Val Ser Met Phe 

50 55 60 

GCA AGT GGG CCG TCT CAG AAA ATC CAG CTT ATA AAC ACC AAT GGG AGT 241 
Ala Ser Gly Pro Ser Gin Lys He Gin Leu He Asn Thr Asn Gly Ser 
65 70 75 80 

TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAT GAC TCT CTC CAG ACT 289 
Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
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50 



85 90 95 

GGG TTT CTT GCC GCG CTG TTC TAG AC A CAT AGT TTC AAC TCG TCC GGG 337 
Gly Phe Leu Ala Ala Leu Phe Tyr Thr His Ser Phe Asn Ser Ser Gly 

100 105 110 

TGC CCA GAG CGC ATG GCC CAG TGC CGC ACC ATT GAC AAG TTC GAC CAG 385 
Cys Pro Glu Arg Met Ala Gin Cys Arg Thr He Asp Lys Phe Asp Gin 

115 120 125 

GGA TGG GGT CCC ATT ACT TAT GCT GAG TCT AGC AGA TCA GAC CAG AGG 433 
Gly Trp Gly Pro He Thr Tyr Ala Glu Ser Ser Arg Ser Asp Gin Arg 

130 135 140 

CCA TAT TGC TGG CAC TAC CCA CCT CCA CAA TGT ACC ATC GTA CCT GCG 481 
Pro Tyr Cys Trp His Tyr Pro Pro Pro Gin Cys Thr He Val Pro Ala 
145 150 155 160 

TCG GAG GTG TGC GGC CCA GTG TAC TGC TTC ACC CCA AGC CCT GTC GTC 529 
Ser Glu Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 

165 170 175 

GTG GGG ACG ACC GAT CGT TTC GGT GTC CCT ACG TAT AGA TGG GGG GAG 577 
Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Arg Trp Gly Glu 

180 185 190 

AAC GAG ACT GAC GTG CTG CTG CTC AAC AAC ACG CGG CCG CCG CAA GGC 625 
Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr Arg Pro Pro Gin Gly 

195 200 205 

AAC TGG TTC GGC TGC ACA TGG ATG AAT AGC ACC GGG TTC ACC AAG AC A 673 
Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Thr 

210 215 220 

TGT GGG GGG CCC CCC TGT AAC ATC GGG GGG GTC GGC AAC AAC ACC CTG 721 
Cys Gly Gly Pro Pro Cys Asn He Gly Gly Val Gly Asn Asn Thr Leu 
225 230 235 240 

ACC TGC CCC ACG GAC TGC TTC CGG AAG CAC CCC GAG GCT ACC TAC ACA 769 
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Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Thr 

245 250 255 

AAA TGT GGT TCG GGG CCT TGG CTG ACA CCT AGG TGC ATG GTT GAC TAT 817 
Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val Asp Tyr 

260 265 270 

CCA TAC AGG CTC TGG CAT TAC CCC TGC ACT GTT AAC TTT ACC ATC TTC 865 
Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe 

275 280 285 

AAG GTT AGG ATG TAT GTG GGG GGG GTG GAG CAC AGG CTC AAT GCT GCA 913 
Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Asn Ala Ala 

290 295 300 

TGC AAT TGG ACC CGA GGA GAG CGT TGT GAC TTG GAG GAC AGG GAT AGG 961 
Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
25 305 310 315 320 

CCG GAG CTC AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG CAG GTA CTG 1009 
Pro Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Val Leu 

325 330 335 

CCC TGT TCC TTC ACC ACC CTA CCA GCT CTG TCC ACT GGC TTG ATT CAC 1057 
Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His 

340 345 350 

CTC CAT CAG AAC ATC GTG GAC GTG CAA TAC CTA TAC GGT ATA GGG TCA 1105 
Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly lie Gly Ser 

355 360 365 

GCG GTT GTC TCC TTT GCA ATC AAA TGG GAG TAT GTC CTG TTG CTT TTC 1153 
Ala Val Val Ser Phe Ala He Lys Trp Glu Tyr Val Leu Leu Leu Phe 

370 375 380 

CTT CTC CTA GCG GAC GCA CGT GTC TGT GCC TGC TTG TGG ATG ATG CTG 1201 
50 Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met Met Leu 

385 390 395 400 
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CTG ATA 1207 
Leu lie 

6 

(2) INFORMATION FOR SEQ ID NO: 5: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1207 base pairs 

(B) TYPE: nucleic acid 

75 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(iv) ANTI -SENSE: No 



25 (vi) ORIGINAL SOURCE: 

(A) ORIGIN: Hepatitis C virus 

(B) CLONE: HCV-J 



30 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



G ATC CCA CAA GCC GTC GTG GAC ATG GTG GCG GGG GCC CAC TGG GGT GTC 49 
He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
15 10 15 

CTA GCG GGC CTT GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTC 97 
Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 

20 25 30 

TTG ATT GTG ATG CTA CTC TTT GCT GGC GTT GAC GGG CAC ACG CAC GTG 145 
Leu He Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr His Val 
50 3 5 4 0 4 5 

ACA GGG GGA AGG GTA GCC TCC AGC ACC CAG AGC CTC GTG TCC TGG CTC 193 
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GGC CCA TCT 
Gly Pro Ser 



Thr Gly Gly Arg Val 
50 

TCA CAA 
Ser Gin 
65 

TGG CAC ATC AAC AGG 
Trp His lie Asn Arg 
85 

ATT GCT GCG 
He Ala Ala 

100 
GAG CGC ATG 
Glu Arg Met 
115 

GGT CCC ATC 
Gly Pro He 



GGG TTC 
Gly Phe 

TGC CCA 
Cys Pro 

GGG TGG 
Gly Trp 
130 
CCA TAT 
Pro Tyr 
145 

TCG CAG 
Ser Gin 

GTG GGG 
Val Gly 

AAT GAG 
Asn Glu 



50 



TGC TGG CAC 
Cys Trp His 

GTG TGT GGT 
Val Cys Gly 
165 

ACG ACC GAT 
Thr Thr Asp 

180 
ACA GAC GTG 
Thr Asp Val 
195 



Ala Ser 
55 

CAG AAA 
Gin Lys 

70 
ACC GCT 
Thr Ala 

CTG TTC 
Leu Phe 

GCT AGC 
Ala Ser 

ACT CAT 
Thr His 
135 
TAC GCG 
Tyr Ala 
150 

CCA GTG 
Pro Val 

CGT TTC 
Arg Phe 

CTG CTA 
Leu Leu 



Ser Thr Gin Ser Leu Val Ser Trp Leu 
60 
AAC ACC 
Asn Thr 



ATC CAA 
He Gin 



CTG AAT 
Leu Asn 

TAC GCA 
Tyr Ala 
105 
TGC CGC 
Cys Arg 
120 

GAT ATG 
Asp Met 

CCT CGA 
Pro Arg 

TAT TGC 
Tyr Cys 

GGC GCT 
Gly Ala 
185 
CTT AGC 
Leu Ser 
200 



CTC GTG 
Leu Val 
75 

TGC AAT 
Cys Asn 

90 
CAC AGG 
His Arg 

CCC ATC 
Pro He 

CCT GAG 
Pro Glu 

CCG TGC 
Pro Cys 
155 
TTC ACT 
Phe Thr 
170 

CCT ACG 
Pro Thr 

AAC ACG 
Asn Thr 



GAC TCC 
Asp Ser 

TTC AAC 
Phe Asn 

GAT GAG 
Asp Glu 
125 
AGC TCG 
Ser Ser 
140 

GGG ATC 
Gly He 

CCG AGC 
Pro Ser 

TAT AGC 
Tyr Ser 

CGG CCG 
Arg Pro 
205 



AAC GGC AGC 
Asn Gly Ser 
80 

CTC CAA ACT 
Leu Gin Thr 
95 

GCG TCC GGG 
Ala Ser Gly 
110 

TTC GCT CAG 
Phe Ala Gin 

GAC CAG AGG 
Asp Gin Arg 

GTG CCT GCG 
Val Pro Ala 
160 

CCT GTT GTA 
Pro Val Val 

175 
TGG GGG GAG 
Trp Gly Glu 
190 

CCT CAA GGC 
Pro Gin Gly 
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AAC TGG TTT GGG TGC ACG TGG ATG AAC AGC ACT GGG TTC ACC AAG ACG 673 
Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Thr 

210 215 220 

TGC GGG GGC CCT CCG TGC AAC ATC GGG GGG GTC GGC AAC AAC ACC TTG 721 
Cys Gly Gly Pro Pro Cys Asn He Gly Gly Val Gly Asn Asn Thr Leu 
225 230 235 240 

GTC TGC CCC ACG GAT TGC TTC CGG AAG CAC CCC GAG GCC ACT TAC ACA 769 
Val Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Thr 

245 250 255 

AAG TGT GGC TCG GGG CCC TGG TTG ACA CCC AGG TGC ATG GTT GAC TAC 817 
Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val Asp Tyr 

260 265 270 

CCA TAC AGG CTC TGG CAC TAC CCC TGC ACT GTT AAC TTT ACC GTC TTT 865 
Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr Val Phe 

275 280 285 

AAG GTC AGG ATG TAT GTG GGG GGC GTG GAG CAC AGG CTC AAT GCT GCA 913 
Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Asn Ala Ala 

290 295 300 

TGC AAT TGG ACT CGA GGA GAG CGC TGT GAC TTG GAG GAC AGG GAT AGG 961 
Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
305 310 315 320 

TCA GAA CTC AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG CAG ATA CTG 1009 
Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin He Leu 

325 330 335 

CCC TGT TCC TTC ACC ACC CTA CCG GCC CTG TCC ACT GGC TTC ATC CAT 1057 
Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His 

340 345 350 

CTT CAC CGG AAC ATC GTG GAC GTC CAA TAC CTC TAC GGT ATA GGG TCG 1105 
Leu His Arg Asn lie Val Asp Val Gin Tyr Leu Tyr Gly He Gly Ser 
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355 360 365 

GCA GTT GTC TCC TTT GCA ATC AAA TGG GAG TAT ATC CTG TTG CTT TTC 1153 
Ala Val Val Ser Phe Ala lie Lys Trp Glu Tyr He Leu Leu Leu Phe 

370 375 380 

CTT CTT CTG GCG GAC GCG CGC GTC TGT GCC TGC TTG TGG ATG ATC CTC 1201 
Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met Met Leu 
385 390 395 400 

CTG ATA 1207 
Leu He 



20 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(iv) ANTI -SENSE: No 

(vi) ORIGINAL SOURCE: 

(A) ORIGIN: Hepatitis C virus 

(B) CLONE: HCV-RNA33 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

G ATC CCG CAA GCT GTC GTG GAC ATC GTC GCG GGG GCC CAC TGG GGA GTC 49 
He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
15 10 15 
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CTG GCG 
Leu Ala 

TTG ATT 
Leu lie 

ACG GGG 
Thr Gly 
50 

TCC GCT 
Ser Ala 

65 
TGG CAC 
Trp His 

GGA TTT 
Gly Phe 

GCC GAG 
Ala Glu 

GGA TGG 
Gly Trp 
130 
CCT TAT 
Pro Tyr 
145 

TCG GAG 
Ser Glu 



GGC CTG 
Gly Leu 
20 

GTG ATG 
Val Met 

35 
GGG GCG 
Gly Ala 

GGG TCG 
Gly Ser 

ATC AAC 
lie Asn 

CTT GCC 
Leu Ala 
100 
AGC GTA 
Ser Val 
115 

GGC CCC 
Gly Pro 

TGC TGG 
Cys Trp 

GTG TGC 
Val Cys 



GCC TAC 
Ala Tyr 

CTA CTC 
Leu Leu 

GTT GCC 
Val Ala 

CAG GAG 
Gin Glu 
70 

AGG ACT 
Arg Thr 
85 

GCG CTG 
Ala Leu 

TTG GCC 
Leu Ala 

ATC ACT 
He Thr 

CAC TAT 
His Tyr 
150 
GGT CCA 
Gly Pro 



TAT TCC 
Tyr Ser 

TTT GCC 
Phe Ala 
40 

CGC ACC 
Arg Thr 

55 
AAC ATC 
Asn He 

GCC CTG 
Ala Leu 

TTC TAC 
Phe Tyr 

AGC TGC 
Ser Cys 
120 
TAC ACC 
Tyr Thr 
135 

CCA CCC 
Pro Pro 

GTG TAT 
Val Tyr 



ATG GTG GGG 
Met Val Gly 
25 

GGC GTT GAC 
Gly Val Asp 

ACC ACC GGG 
Thr Thr Gly 

CAG CTT ATA 
Gin Leu lie 
75 

AAC TGC AAC 
Asn Cys Asn 
90 

ACA CAC AAG 
Thr His Lys 
105 

CGC TTC ATC 
Arg Phe He 

GAG CGT AAC 
Glu Arg Asn 

CGA CAG TGT 
Arg Gin Cys 
155 

TGT TTC ACC 
Cys Phe Thr 



AAC TGG GCT 
Asn Trp Ala 
30 

GGG CAA ACC 
Gly Gin Thr 
45 

TTC GCG TCC 
Phe Ala Ser 
60 

AAC ACC AAT 
Asn Thr Asn 

GAC TCC CTC 
Asp Ser Leu 

TTC AAC TCA 
Phe Asn Ser 
110 

GAC GAG TTC 
Asp Glu Phe 

125 
AGT TCG GAC 
Ser Ser Asp 
140 

GGT ATC ATA 
Gly He He 

CCA AGC CCT 
Pro Ser Pro 



AAG GTT 
Lys Val 

TAT ACG 
Tyr Thr 

CTC TTC 
Leu Phe 

GGC AGC 
Gly Ser 
80 

AAC ACT 
Asn Thr 

95 
TCC AGA 
Ser Arg 

GAT CAG 
Asp Gin 

CAG AGG 
Gin Arg 

CCC GCG 
Pro Ala 
160 
GTT GTG 
Val Val 
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GTG GGG 
Val Gly 

AAT GAG 
Asn Glu 

AAC TGG 
Asn Trp 
210 
TGC GGG 
Cys Gly 
225 

ACT TGC 
Thr Cys 

AAA TGT 
Lys Cys 

CCA TAC 
Pro Tyr 

AAG GTT 
Lys Val 
290 
TGC AAT 
Cys Asn 
305 

TCA GAG 



ACA ACC 
Thr Thr 
180 
ACG GAC 
Thr Asp 
195 

TTC GGC 
Phe Gly 

GGT CCC 
Gly Pro 

CCC ACG 
Pro Thr 

GGT TCG 
Gly Ser 
260 
AGG CTC 
Arg Leu 
275 

AGG ATG 
Arg Met 



165 

GAT CGG 
Asp Arg 

GTG CTG 
Val Leu 

TGT ACA 
Cys Thr 

CCG TGT 
Pro Cys 
230 
GAC TGC 
Asp Cys 
245 

GGG CCT 
Gly Pro 

TGG CAC 
Trp His 

TAT GTG 
Tyr Val 



TTC GGT 
Phe Gly 

GTT CTC 
Val Leu 
200 
TGG ATG 
Trp Met 
215 

CAC ATC 
His lie 

TTC CGG 
Phe Arg 

TGG TTG 
Trp Leu 

TAC CCC 
Tyr Pro 
280 
GGG GGC 
Gly Gly 
295 

TGG ACT CGA GGA GAC CGT TGT AAC 
Trp Thr Arg Gly Asp Arg Cys Asn 
310 

CTT AGT CCG CTG CTG CTG TCT ACG 



170 
GTC CCT 
Val Pro 
185 

AAC AAC 
Asn Asn 

AAT GGC 
Asn Gly 

GGG GGG 
Gly Gly 

AAG CAT 
Lys His 
250 
ACA CCT 
Thr Pro 
265 

TGC ACT 
Cys Thr 

GTG GAG 
Val Glu 



ACA TAC 
Thr Tyr 

ACG CGG 
Thr Arg 

ACT GGT 
Thr Gly 
220 
CGC GGC 
Arg Gly 
235 

CCC GAG 
Pro Glu 

AGG TGC 
Arg Cys 

GTC AAC 
Val Asn 

CAC AGG 
His Arg 
300 
TTG GAG 
Leu Glu 
315 

ACA GAG 



AGC TGG 
Ser Trp 
190 
CCG CCG 
Pro Pro 
205 

TTC ACC 
Phe Thr 

AAC AAC 
Asn Asn 

GCT ACG 
Ala Thr 

ATG GTT 
Met Val 
270 
TTT ACC 
Phe Thr 
285 

CTC ATT 
Leu lie 



175 

GGG GAG 
Gly Glu 

CAG GGC 
Gin Gly 

AAG ACA 
Lys Thr 

ACC CTG 
Thr Leu 
240 
TAT ACA 
Tyr Thr 
255 

GAT TAC 
Asp Tyr 

ACC TTT 
Thr Phe 

GCT GCA 
Ala Ala 



577 



GAC AGG GAT AGA 
Asp Arg Asp Arg 
320 

TGG CAG ATA CTG 
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Set Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin He Leu 

325 330 335 

CCC TGT TCC TTC ACC ACC CTA CCG GCT CTC TCC ACC GGT TTG ATC CAT 1057 
Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His 

340 345 350 

CTC CAT CAG AAC ATC GTG GAC GTG CAA TAC CTG TAC GGT ATA GGG TCT 1105 
Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly He Gly Ser 

355 360 365 

GCT GTT GTC TCC ATT GCA ATC AGG TGG GAA TAT GTC CTG TTG CTT TTC 1153 
Ala Val Val Ser He Ala He Arg Trp Glu Tyr Val Leu Leu Leu Phe 

370 375 380 

CTT CTC CTG GCG GAC GCG CGT GTC TGT GCC TGC TTG TGG ATG ATG CTG 1201 
Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met Met Leu 
385 390 395 400 

CTG ATA 1207 
Leu He 

(2) INFORMATION FOR SEQ ID NO: 7: 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 



30 



40 



45 



50 



(iv) ANTI-SENSE: No 

(vi) ORIGINAL SOURCE: 

(A) ORIGIN: Hepatitis C virus 
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(B) CLONE: HCVl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

G ATC CCA CAA GCC ATC TTG GAC ATG ATC GCT GGT OCT CAC TGG GGA GTC 49 
lie Pro Gin Ala He Leu Asp Met He Ala Gly Ala His Trp Gly Val 
15 10 15 

CTG GCG GGC ATA GCG TAT TTC TCC ATG GTG GGG AAC TGG GCG AAG GTC 97 
Leu Ala Gly He Ala Tyr Phe Ser Met Val Gly Asn Trp Ala Lys Val 

20 25 30 

CTG GTA GTG CTG CTG CTA TTT GCC GGC GTC GAC GCG GAA ACC CAC GTC 145 
Leu Val Val Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr His Val 

35 40 45 

ACC GGG GGA AGT GCC GGC CAC ACT GTG TCT GGA TTT GTT AGC CTC CTC 193 
Thr Gly Gly Ser Ala Gly His Thr Val Ser Gly Phe Val Ser Leu Leu 
50 55 60 

30 GCA CCA GGC GCC AAG CAG AAC GTC CAG CTG ATC AAC ACC AAC GGC AGT 241 

Ala Pro Gly Ala Lys Gin Asn Val Gin Leu He Asn Thr Asn Gly Ser 
65 70 75 80 

35 TGG CAC CTC AAT AGC ACG GCC CTG AAC TGC AAT GAT AGC CTC AAC ACC 289 

Trp His Leu Asn Ser Thr Ala Leu Asn Cys Asn Asp Ser Leu Asn Thr 

85 90 95 

GGC TGG TTG GCA GGG CTT TTC TAT CAC CAC AAG TTC AAC TCT TCA GGC 337 
Gly Trp Leu Ala Gly Leu Phe Tyr His His Lys Phe Asn Ser Ser Gly 

100 105 110 

TGT CCT GAG AGG CTA GCC AGC TGC CGA CCC CTT ACC GAT TTT GAC CAG 385 
Cys Pro Glu Arg Leu Ala Ser Cys Arg Pro Leu Thr Asp Phe Asp Gin 

115 120 125 

GGC TGG GGC CCT ATC AGT TAT GCC AAC GGA AGC GGC CCC GAC CAG CGC 433 
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Gly Trp Gly Pro lie Ser Tyr Ala Asn Gly Ser Gly Pro Asp Gin Arg 

130 135 140 

CCC TAG TGC TGG CAC TAG CCC CCA AAA CCT TGC GGT ATT GTG CCC GCG 481 
Pro Tyr Cys Trp His Tyr Pro Pro Lys Pro Cys Gly He Val Pro Ala 
145 150 155 160 

AAG AGT GTG TGT GGT CCG GTA TAT TGC TTC ACT CCC AGC CCC GTG GTG 529 
Lys Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 

165 170 175 

GTG GGA ACG ACC GAC AGG TCG GGC GCG CCC ACC TAC AGC TGG GGT GAA 577 
Val Gly Thr Thr Asp Arg Ser Gly Ala Pro Thr Tyr Ser Trp Gly Glu 

180 185 190 

AAT GAT ACG GAC GTC TTC GTC CTT AAC AAT ACC AGG CCA CCG CTG GGC 625 
Asn Asp Thr Asp Val Phe Val Leu Asn Asn Thr Arg Pro Pro Leu Gly 

195 200 205 

AAT TGG TTC GGT TGT ACC TGG ATG AAC TCA ACT GGA TTC ACC AAA GTG 673 
Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Val 

210 215 220 

TGC GGA GCG CCT CCT TGT GTC ATC GGA GGG GCG GGC AAC AAC ACC CTG 721 
Cys Gly Ala Pro Pro Cys Val He Gly Gly Ala Gly Asn Asn Thr Leu 
225 230 235 240 

CAC TGC CCC ACT GAT TGC TTC CGC AAG CAT CCG GAC GCC ACA TAC TCT 769 
His Cys Pro Thr Asp Cys Phe Arg Lys His Pro Asp Ala Thr Tyr Ser 

245 250 255 

CGG TGC GGC TCC GGT CCC TGG ATC ACA CCC AGG TGC CTG GTC GAC TAC 817 
Arg Cys Gly Ser Gly Pro Trp He Thr Pro Arg Cys Leu Val Asp Tyr 

260 265 270 

CCG TAT AGG CTT TGG CAT TAT CCT TGT ACC ATC AAC TAC ACC ATA TTT 865 
Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr He Asn Tyr Thr He Phe 
275 280 285 
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AAA ATC AGG ATG TAC GTG GGA GGG GTC GAA CAC AGG CTG GAA GCT GCC 913 
Lys lie Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Glu Ala Ala 
*5 290 295 300 

TGC AAC TGG ACG CGG GGC GAA CGT TGC GAT CTG GAA GAG AGG GAC AGG 961 
Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
305 310 315 320 

TCC GAG CTC AGC CCG TTA CTG CTG ACC ACT ACA CAG TGG CAG GTC CTC 1009 
Ser Glu Leu Ser Pro Leu Leu Leu Thr Thr Thr Gin Trp Gin Val Leu 

325 330 335 

CCG TGT TCC TTC ACA ACC CTA CCA GCC TTG TCC ACC GGC CTC ATC CAC 1057 
Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His 

340 345 350 

CTC CAC CAG AAC ATT GTG GAC GTG CAG TAC TTG TAC GGG GTG GGG TCA 1105 
Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser 

355 360 365 

AGC ATC GCG TCC TGG GCC ATT AAG TGG GAG TAC GTC GTT CTC CTG TTC 1153 
30 Ser lie Ala Ser Trp Ala He Lys Trp Glu Tyr Val Val Leu Leu Phe 

370 375 380 

CTT CTG CTT GCA GAC GCG CGC GTC TGC TCC TGC TTG TGG ATG ATG CTA 1201 
Leu Leu Leu Ala Asp Ala Arg Val Cys Ser Cys Leu Trp Met Met Leu 
385 390 395 400 

CTC ATA 1207 
Leu He 
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(2) INFORMATION FOR SEQ ID NO: 8: 

♦ 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1207 base pairs 
^0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

75 

(iv) ANTI -SENSE: No 

20 

(vi) ORIGINAL SOURCE: 

(A) ORIGIN: Hepatitis C virus 
^5 (B) CLONE: H77 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

G ATC CCA CAA GCC ATC ATG GAC ATG ATC GCT GGT GCT CAC TGG GGA GTC 49 
He Pro Gin Ala He Met Asp Met He Ala Gly Ala His Trp Gly Val 
15 10 15 

CTG GCG GGC ATA GCG TAT TTC TCC ATG GTG GGG AAC TGG GCG AAG GTC 97 
Leu Ala Gly He Ala Tyr Phe Ser Met Val Gly Asn Trp Ala Lys Val 

20 25 30 

CTG GTA GTG CTG CTG CTA TTT GCC GGC GTC GAC GCG GAA ACC CAC GTC 145 
Leu Val Val Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr His Val 

35 40 45 

ACC GGG GGA AGT GCC GGC CGC ACC ACG GCT GGG CTT GTT GGT CTC CTT 193 
Thr Gly Gly Ser Ala Gly Arg Thr Thr Ala Gly Leu Val Gly Leu Leu 

50 55 60 

ACA CCA GGC GCC AAG CAG AAC ATC CAA CTG ATC AAC ACC AAC GGC AGT 241 
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Thr Pro Gly Ala Lys Gin Asn He Gin Leu He Asn Thr Asn Gly Ser 
65 70 75 80 

TGG CAC ATC AAT AGC ACG GCC TTG AAC TGC AAT GAA AGC CTT AAC ACC 289 
Trp His He Asn Ser Thr Ala Leu Asn Cys Asn Glu Ser Leu Asn Thr 

85 90 95 

GGC TGG TTA GCA GGG CTC TTC TAT CAC CAC AAA TTC AAC TCT TCA GGC 337 
Gly Trp Leu Ala Gly Leu Phe Tyr His His Lys Phe Asn Ser Ser Gly 

100 105 110 

TGT CCT GAG AGG TTG GCC AGC TGC CGA CGC CTT ACC GAT TTT GCC CAG 385 
Cys Pro Glu Arg Leu Ala Ser Cys Arg Arg Leu Thr Asp Phe Ala Gin 

115 120 125 

GGC TGG GGT CCT ATC AGT TAT GCC AAC GGA AGC GGC CTC GAC GAA CGC 433 
Gly Trp Gly Pro He Ser Tyr Ala Asn Gly Ser Gly Leu Asp Glu Arg 

130 135 140 

CCC TAC TGC TGG CAC TAC CCT CCA AGA CCT TGT GGC ATT GTG CCC GCA 481 
Pro Tyr Cys Trp His Tyr Pro Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

AAG AGC GTG TGT GGC CCG GTA TAT TGC TTC ACT CCC AGC CCC GTG GTG 529 
Lys Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 

165 170 175 

GTG GGA ACG ACC GAC AGG TCG GGC GCG CCT ACC TAC AGC TGG GGT GCA 577 
Val Gly Thr Thr Asp Arg Ser Gly Ala Pro Thr Tyr Ser Trp Gly Ala 

180 185 190 

AAT GAT ACG GAT GTC TTC GTC CTT AAC AAC ACC AGG CCA CCG CTG GGC 625 
Asn Asp Thr Asp Val Phe Val Leu Asn Asn Thr Arg Pro Pro Leu Gly 

195 200 205 

AAT TGG TTC GGT TGT ACC TGG ATG AAC TCA ACT GGA TTC ACC AAA GTG 673 
Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Val 
210 215 220 
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TGC GGA GCG CCC CCT TGT GTC ATC GGA GGG GTG GGC AAC AAC ACC TTG 721 
Cys Gly Ala Pro Pro Cys Val lie Gly Gly Val Gly Asn Asn Thr Leu 
225 230 235 240 

CTC TGC CCC ACT GAT TGC TTC CGC AAG CAT CCG GAA GCC ACA TAC TCT 769 
Leu Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ser 

245 250 255 

CGG TGC GGC TCC GGT CCC TGG ATT ACA CCC AGG TGC ATG GTC GAC TAC 817 
Arg Cys Gly Ser Gly Pro Trp lie Thr Pro Arg Cys Met Val Asp Tyr 

260 265 270 

CCG TAT AGG CTT TGG CAC TAT CCT TGT ACC ATC AAT TAC ACC ATA TTC 865 
Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr He Asn Tyr Thr He Phe 

275 280 285 

AAA GTC AGG ATG TAC GTG GGA GGG GTC GAG CAC AGG CTG GAA GCG GCC 913 
2g Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Glu Ala Ala 

290 295 300 

TGC AAC TGG ACG CGG GGC GAA CGC TGT GAT CTG GAA GAC AGG GAC AGG 961 
30 Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 

305 310 315 320 

TCC GAG CTC AGC CCA TTG CTG CTG TCC ACC ACA CAG TGG CAG GTC CTT 1009 
Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gin Trp Gin Val Leu 

325 330 335 

CCG TGT TCT TTC ACG ACC CTG CCA GCC TTG TCC ACC GGC CTC ATC CAC 1057 
Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His 

340 345 350 

CTC CAC CAG AAC ATT GTG GAC GTG CAG TAC TTG TAC GGG GTA GGG TCA 1105 
Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser 

355 360 365 

AGC ATC GCG TCC TGG GCC ATT AAG TGG GAG TAC GTC GTT CTC CTG TTC 1153 
Ser He Ala Ser Trp Ala He Lys Trp Glu Tyr Val Val Leu Leu Phe 
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370 375 380 

CTT CTG CTT GCA GAC GCG CGC GTC TGC TCC TGC TTG TGG ATG ATG TTA 1201 
Leu Leu Leu Ala Asp Ala Arg Val Cys Ser Cys Leu Trp Met Met Leu 
385 390 395 400 

CTC ATA 1207 
Leu lie 

(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(iv) ANTI -SENSE: No 

(vi) ORIGINAL SOURCE: 

(A) ORIGIN: Hepatitis C virus 

(B) CLONE: H90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

G ATC CCA CAA GCC ATC ATG GAT ATG ATC GCT GGT GCT CAC TGG GGA GTC 49 
lie Pro Gin Ala He Met Asp Met He Ala Gly Ala His Trp Gly Val 
15 10 15 

CTG GCG GGC ATA GCG TAT TTC TCC ATG GTA GGG AAC TGG GCG AAG GTC 97 
Leu Ala Gly He Ala Tyr Phe Ser Met Val Gly Asn Trp Ala Lys Val 
20 25 30 
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CTA GTA 
Leu Val 

ACC GGG 
Thr Gly 
50 

ACA CGA 
Thr Arg 

65 
TGG CAC 
Trp His 

GGC TGG 
Gly Trp 

TGT CCT 
Cys Pro 

GGC TGG 
Gly Trp 
130 
CCC TAG 
Pro Tyr 
145 

AAG AGC 
Lys Ser 

GTG GGA 
Val Gly 



GTG CTG 
Val Leu 

35 
GGA AGT 
Gly Ser 

GGC CCC 
Gly Pro 

ATC AAT 
lie Asn 

ATA GCG 
He Ala 
100 
GAG AGG 
Glu Arg 
115 

GGC CCT 
Gly Pro 

TGC TGG 
Cys Trp 

GTG TGT 
Val Cys 

ACG ACC 
Thr Thr 



CTG CTA TTT 
Leu Leu Phe 

GCC GGC CGC 
Ala Gly Arg 
55 

AAG CAG AAC 
Lys Gin Asn 
70 

AGC ACG GCC 
Ser Thr Ala 
85 

GGG CTC TTC 
Gly Leu Phe 

TTG GCC AGC 
Leu Ala Ser 

ATC AGT TAT 
He Ser Tyr 
135 

CAC TAC CCC 
His Tyr Pro 

150 
GGC CCG GTA 
Gly Pro Val 
165 

GAC AGG TCG 
Asp Arg Ser 



GCC GGC 
Ala Gly 

40 
TCC GTG 
Ser Val 

ATC CAG 
He Gin 

CTG AAC 
Leu Asn 

TAT CAC 
Tyr His 
105 
TGC CGA 
Cys Arg 
120 

GCC AAC 
Ala Asn 

CCA AGA 
Pro Arg 

TAC TGC 
Tyr Cys 

GGC GCG 
Gly Ala 



GTC GAC 
Val Asp 

CTT GGG 
Leu Gly 

CTG ATC 
Leu He 
75 

TGC AAT 
Cys Asn 

90 
CAT GGA 
His Gly 

CGC CTT 
Arg Leu 

GGA AGC 
Gly Ser 

CCT TGT 
Pro Cys 
155 
TTC ACT 
Phe Thr 
170 

CCT ACC 
Pro Thr 



GCG GAA 
Ala Glu 
45 

ATT GCT 
He Ala 

60 
AAA ACC 
Lys Thr 

GAC AGC 
Asp Ser 

TTC AAC 
Phe Asn 

ACC GAT 
Thr Asp 
125 
GGC CCC 
Gly Pro 
140 

GGC ATT 
Gly He 

CCC AGC 
Pro Ser 

TAC AAC 
Tyr Asn 



ACC CAC GTC 
Thr His Val 

AGT TTC CTT 
Ser Phe Leu 

AAC GGC AGT 
Asn Gly Ser 
80 

CTT AAC GCC 
Leu Asn Ala 
95 

TCT TCA GGC 
Ser Ser Gly 
110 

TTT GAC CAG 
Phe Asp Gin 

GAC GAA CGT 
Asp Glu Arg 

GTG CCC GCA 
Val Pro Ala 
160 

CCC GTG GTG 
Pro Val Val 

175 
TGG GGT GAA 
Trp Gly Glu 
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180 185 190 

AAT GAT ACG GAT GTC CTC ATC CTT AAC AAC ACC AGG CCG CCG CTG GGC 625 
Asn Asp Thr Asp Val Leu lie Leu Asn Asn Thr Arg Pro Pro Leu Gly 

195 200 205 

AAT TGG TTC GGT TGT ACC TGG ATG AAC TCA ACT GGA TTC ACC AAA GTG 673 
Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Val 

210 215 220 

TGC GGA GCG CCC CCT TGT GTC ATC GGA GGG GTG GGC AAC AAC ACC TTG 721 
Cys Gly Ala Pro Pro Cys Val lie Gly Gly Val Gly Asn Asn Thr Leu 
225 230 235 240 

CGC TGC CCC ACT GAT TGT TTC CGC AAG CAT CCG GAA GCC ACA TAC TCT 769 
Arg Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ser 

245 250 255 

CGG TGC GGC TCC GGT CCC TGG ATC ACA CCC AGG TGC ATG GTC CAC TAC 817 
Arg Cys Gly Ser Gly Pro Trp He Thr Pro Arg Cys Met Val His Tyr 

260 265 270 

CCG TAT AGG CTT TGG CAC TAT CCT TGT ACC ATC AAT TAC ACT ATA TTT 865 
Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr He Asn Tyr Thr He Phe 

275 280 285 

AAA GTC AGG ATG TAC GTG GGA GGG ATC GAG CAC AGG CTG GAA GCG GCC 913 
Lys Val Arg Met Tyr Val Gly Gly He Glu His Arg Leu Glu Ala Ala 

290 295 300 

TGC AAC TGG ACG CGG GGC GAA CGT TGC GAT CTG GAA GAC AGG GAC AGG 961 
Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
305 310 315 320 

TCC GAG CTC AGC CCA TTG CTG CTG TCC ACT ACG CAG TGG CAG GTC CTT 1009 
Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gin Trp Gin Val Leu 

325 330 335 

CCG TGT TCT TTC ACG ACC CTG CCA GCC TTG TCC ACC GGC CTC ATC CAC 1057 
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Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu lie His 

340 345 350 

CTC CAC CAG AAC ATT GTG GAC GTG CAG TAC TTG TAC GGG GTA GGG TCA 1105 
Leu His Gin Asn lie Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser 

355 360 365 

AGC ATC GCG TCC TGG ACC ATC AAG TGG GAG TAC GTC GTT CTC CTG TTC 1153 
Ser He Ala Ser Trp Thr He Lys Trp Glu Tyr Val Val Leu Leu Phe 

370 375 380 

CTC CTG CTT GCA GAC GCG CGC GTC TGC TCC TGC TTG TGG ATG ATG TTA 1201 
Leu Len Leu Ala Asp Ala Arg Val Cys Ser Cys Leu Trp Met Met Leu 
385 390 395 400 

CTC ATA 1207 
Leu He 

(2) INFORMATION FOR SEQ ID NO: 10: 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 
^5 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(iv) ANTI-SENSE: No 

(vi) ORIGINAL SOURCE: 

(A) ORIGIN: Hepatitis C virus 

(B) CLONE: Jl(JM) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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G ATC CCA CAA GCC ATC TTG GAT ATG ATC GCT GGT GCT CAC TGG GGA GTC 49 
lie Pro Gin Ala He Leu Asp Met He Ala Gly Ala His Trp Gly Val 
15 10 15 

CTG GCG GGC ATA GCG TAT TTC TCC ATG GTG GGG AAC TGG GCG AAG GTC 97 
Leu Ala Gly He Ala Tyr Phe Ser Met Val Gly Asn Trp Ala Lys Val 

20 25 30 

CTG GTA GTG CTG TTG CTG TTT GCC GGC GTC GAC GCG GAA ACC ATC GTC 145 
Leu Val Val Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr He Val 

35 40 45 

TCC GGG GGA CAA GCC GCC CGC GCC ATG TCT GGA CTT GTT AGT CTC TTC 193 
Ser Gly Gly Gin Ala Ala Arg Ala Met Ser Gly Leu Val Ser Leu Phe 

50 55 60 

ACA CCA GGC GCT AAG CAG AAC ATC CAG CTG ATC AAC ACC AAC GGC AGT 241 
Thr Pro Gly Ala Lys Gin Asn He Gin Leu He Asn Thr Asn Gly Ser 
65 70 75 80 

TGG CAC ATC AAT AGC ACG GCC TTG AAC TGC AAT GAA AGC CTT AAC ACC 289 
Trp His He Asn Ser Thr Ala Leu Asn Cys Asn Glu Ser Leu Asn Thr 

85 90 95 

GGC TGG TTA GCA GGG CTT ATC TAT CAA CAC AAA TTC AAC TCT TCG GGC 337 
Gly Trp Leu Ala Gly Leu He Tyr Gin His Lys Phe Asn Ser Ser Gly 

100 105 110 

TGT CCC GAG AGG TTG GCC AGC TGC CGA CGC CTT ACC GAT TTT GAC CAG 385 
Cys Pro Glu Arg Leu Ala Ser Cys Arg Arg Leu Thr Asp Phe Asp Gin 

115 120 125 

GGC TGG GGC CCT ATC AGT CAT GCC AAC GGA AGC GGC CCC GAC CAA CGC 433 
Gly Trp Gly Pro He Ser His Ala Asn Gly Ser Gly Pro Asp Gin Arg 

130 135 140 

CCC TAT TGT TGG CAC TAC CCC CCA AAA CCT TGC GGT ATC GTG CCC GCA 481 
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Pro Tyr Cys Trp His Tyr Pro Pro Lys 
145 150 
- 5 AAG AGC GTA TGT GGC CCG GTA TAT TGC 

Lys Ser Val Cys Gly Pro Val Tyr Cys 
165 

10 

(2) INFORMATION FOR SEQ ID NO: 11: 

75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

20 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(iv) ANTI-SENSE: No 



Pro Cys Gly II Val Pro Ala 
155 160 
TTC ACT CCC AGC CCC 523 
Phe Thr Pro Ser Pro 
170 



30 (vi) ORIGINAL SOURCE: 

(A) ORIGIN: Hepatitis C virus 

(B) CLONE: J4(JM) 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAC TGG GGA GTC 49 
lie Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
15 10 15 

CTG GCG GGC CTT GCC TAC TAT TCC ATG GTA GGG AAC TGG GCT AAG GTC 97 
Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 

20 25 30 

CTG ATT GTG GCG CTA CTC TTC GCC GGC GTT GAC GGG GAG ACC TAC ACG 145 
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Leu lie Val Ala Leu Leu Phe Ala Gly Val Asp Gly Glu Thr Tyr Thr 

35 40 45 

TCG GGG GGG GCG GCC AGC CAC ACC ACC TCC ACG CTC GCG TCC CTC TTC 193 
Ser Gly Gly Ala Ala Ser His Thr Thr Ser Thr Leu Ala Ser Leu Phe 

50 55 60 

TCA CCT GGG GCG TCT CAG AGA ATC CAG CTT GTG AAT ACC AAC GGC AGC 241 
Ser Pro Gly Ala Ser Gin Arg lie Gin Leu Val Asn Thr Asn Gly Ser 
65 70 75 80 

TGG CAC ATC AAC AGG ACT GCC CTA AAC TGC AAT GAC TCC CTC CAC ACT 289 
Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu His Thr 

85 90 95 

GGG TTC CTT GCC GCG CTG TTC TAG ACA CAC AGG TTC AAC TCG TCC GGG 337 
Gly Phe Leu Ala Ala Leu Phe Tyr Thr His Arg Phe Asn Ser Ser Gly 

100 105 110 

TGC CCG GAG CGC ATG GCC AGC TGC CGC CCC ATT GAC TGG TTC GCC CAG 385 
Cys Pro Glu Arg Met Ala Ser Cys Arg Pro tie Asp Trp Phe Ala Gin 
30 115 120 125 

GGA TGG GGC CCC ATC ACC TAT ACT GAG CCT GAC AGC CCG GAT CAG AGG 433 
Gly Trp Gly Pro He Thr Tyr Thr Glu Pro Asp Ser Pro Asp Gin Arg 
35 130 135 140 

CCT TAT TGC TGG CAT TAC GCG CCT CGA CCG TGT GGT ATC GTA CCC GCG 481 
Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

TCG CAG GTG TGT GGT CCA GTG TAT TGC TTC ACC CCA AGC CCT 523 
Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro 
165 170 
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(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 amino acids 
5 (B) TYPE: protein 

(vi) ORIGINAL SOURCE: 
10 (A) ORIGIN: Hepatitis C virus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

75 





lie Pro Gin 


Ala 


Val 


Val 


Asp 


Met 


Val 


Ala 


Gly Ala His Trp Gly Val 








He 


Met 






He 






20 


























Leu 














1 




5 










10 


15 




Leu Ala Gly 


Leu 


Ala 


Tyr 


Tyr 


Ser 


Met 


Val 


Gly Asn Trp Ala Lys Val 


25 
























He 






Phe 






Ala 








20 










25 




30 




Leu Val Val 


Met 


Leu 


Leu 


Phe 


Ala 


Gly 


Val 


Asp Gly Gly Thr His Val 


30 






















He 


Ala 














Ala Arg Arg Thr 






Leu 














His Tyr Arg 


35 


















Asp He 




















Gin Gin 




















Glu His 


40 


















Thr Thr 




35 










40 






45 




Thr Gly Gly 


Lys 


Val 


Ala 


Tyr 


Thr 


Thr 


Gin 


Gly Phe Thr Pro Phe Phe 


45 


Ser Val Ala 


Val 


Gin 


Gly 


His 


Val 


Val 


Ser 


Arg Leu Val Ser Leu Leu 




Met 


Ala 


Ala 


Ser 


Lys 


Ser 


Met 


Asn 


Ser Val Ala Arg Met 






Arg 






Ser 


Ala 


Ala 


Thr 


Thr He Gly Trp 


50 




Ser 






Arg 


Arg 




His 


Ala 
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Gin 
His 
Asn 
He 



Phe Gly Leu 
His Tyr 
Ala 



50 55 60 

Ser Arg Gly Pro Ser Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser 
Arg Pro Ala Gin Glu Arg Val He Lys 

Thr Ser Ser Lys Asn 
Ala Gin Ala Asp 

Asn Ala Arg 
Leu 

65 70 75 80 

Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Asn Thr 
Leu Ser Glu Gin Ala 

Lys 
His 

85 90 95 

Gly Phe Leu Ala Ala Leu Phe Tyr Thr His Ser Phe Asn Ala Ser Gly 
Trp He Thr He Arg Lys Asp Ser Arg 
Gly Ala Arg 

His Gly 
Gin 

100 105 110 

Cys Pro Glu Arg Met Ala Gly Cys Arg Pro He Asp Glu Phe Ala Gin 
Ala Glu Ser Val Leu Ser Cys Ser Leu Ser Lys Asp 

Gin Gin Thr Thr Trp 
Phe Asp 
Arg Thr 
115 120 125 
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Gly Trp Gly Pro He Thr His Val Val Pro Asn He 
Asp Ser Tyr Ala Gin Ser Asp Val 

Asp Glu Arg Ser Asn 
Thr Met Gly Glu Arg 
Asn Asn Gin Arg Ser 
Lys Gly Gly 
Thr 

130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly 

Pro Pro Gin Thr 
Lys 

145 150 155 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro 
Trp Glu 
Lys Ser 

165 170 
Val Gly Thr Thr Asp Arg Phe Gly Ala Pro Thr Tyr 

Ser Val 



Ser Asp Gin Arg 
Pro Glu Glu Lys 
Thr 
Gly 



He Val Pro Ala 
Val 

160 

Ser Pro Val Val 



50 



180 185 
Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr Arg 
Asp Phe Val Ser 

He 

195 200 
Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly 

Ser 

210 215 220 

Cys Gly Gly Pro Pro Cys Asn He Gly Gly Val Gly 



175 

Asn Trp Gly Asn 
Thr Ala 
Arg Asp 
Ser Glu 
190 

Pro Pro Gin Gly 
His 
Leu 

205 

Phe Thr Lys Thr 
Val 

Asn Asn Thr Leu 
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Ala His Arg Arg 

Val Ala 
225 230 235 240 

Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Thr 
Val Asp Ser 

His 
Leu 
Arg 

245 250 255 

Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Leu Val His Tyr 
Arg He Met Asp 

260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe 

He Tyr Val 
Thr 

275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Glu Ala Ala 
He He Asp 

Asn 
He 

290 295 300 

Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 

Asp Asn 

305 310 315 320 

Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Val Leu 
Ala Thr Gin He 

Pro 

325 330 335 

Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His 
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340 345 350 

Leu His Gin Asn lie Val Asp Val Gin Tyr Leu Tyr Gly lie Gly Ser 
Arg Val Val 
355 360 365 

Ala Val Val Ser Phe Ala lie Lys Trp Glu Tyr lie Leu Leu Leu Phe 
Ser He Ala He Val Arg Val Val 

Trp Thr 

370 375 380 

Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met Met Leu 

Ser 

385 390 395 400 

Leu He 



25 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

35 

(D) TOPOLOGY: linear 



(ii) MOLECULAR TYPE: the other nucleic acid (synthesized DNA for 

PCR) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 



GCTATCAGCA GCATCATCCA 20 



(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: the Other nucleic acid (synthesized DNA for 

PCR) 

( ) SEQUENCE CHARACTERISTIC: N represents inosine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 



25 CAGNTANTCC GGATCCCNCA AG 22 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: the other nucleic acid (synthesized DNA for 

PCR) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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GTAAAACGAC GGCCAGT 17 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULAR TYPE: the Other nucleic acid (synthesized DNA for 



PGR) 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 



CAGGAAACAG CTATGAC 17 



(2) INFORMATION FOR SEQ ID NO: 17: 



^® (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



PCR) 



(ii) MOLECULAR TYPE: the Other nucleic acid (synthesized DNA for 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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GGACTAGTCC 10 
(2) INFORMATION FOR SEQ ID N0:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

;5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULAR TYPE: the other nucleic acid (synthesized DNA for 

PGR) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



CTAGAGAATT CGGTAC 16 
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Claims 

35 1. A diagnostic reagent for hepatitis C, which detects an antibody induced by infection of hepatitis C virus, 
characterized in that said diagnostic reagent comprises the second envelope protein or the first non- 
structural protein which is encoded by the gene of hepatitis C virus and has a suger chain. 

2. The diagnostic reagent for hepatitis 0 according to Claim 1, wherein the second envelope protein or the 
40 first non -structural protein is represented by an amino acid sequence selected from SEQUENCE ID 

Nos.1-12 of SEQUENCE LISTING or an amino acid sequence having the same as a part thereof. 

3. The diagnostic reagent for hepatitis 0 according to Claim 1, wherein the second envelope protein or the 
first non-structural protein is encoded by a base sequence selected from SEQUENCE ID Nos.1-2, and 

45 4-11 of SEQUENCE LISTING. 

4. The diagnostic reagent for hepatitis C according to Claim 1 , wherein the second envelope protein or the 
first non-structural protein is produced by an animal cell. 

50 5. The diagnostic reagent for hepatitis C according to Claim 4. wherein the animal cell is CHO cell. 

6. A method for detecting an anti-hepatitis C virus antibody, wherein the second envelope protein or the 
first non-structural protein which is encoded by th gene of hepatitis C virus and has a suger chain is 
used as an antigen to detect the antibody specific to said antigen. 



55 



The method according to Claim 6. wherein the s cond envelop protein or the first non-structural 
protein is repr sented by an amino acid s quenc s I cted from SEQUENCE ID Nos.1-12 of 
SEQUENCE LISTING or an amino acid sequence having the same as a part thereof. 
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8. The method according to Claim 6, wherein the second envelope protein or the first non-structural 
protein is encoded by a base sequence selected from SEQUENCE ID Nos.1-2, and 4-11 of SE- 
QUENCE LISTING. 

5 9. The method according to Claim 9, wherein the second envelope protein or the first non-structural 
protein is produced by an animal cell. 

10. The method according to Claim 6, wherein the animal cell is CHO cell. 

10 11. A method for detecting an anti-hepatitis C virus antibody, which comprises the steps of contacting a 
sample with the second envelope protein or the first non-structural protein which is encoded by the 
gene of hepatitis C virus and has a suger chain under the conditions that the second envelope protein 
or the first non-structural protein Is bound to the anti-hepatitis C virus antibody to form an im- 
munological complex and measuring the formation of the immunological complex to confirm the 

75 presence of the anti-hepatitis C virus antibody in the sample. 

12. The method according to Claini 11, wherein the formation of the immunological complex is measured 
by RIA, ELISA, fluorescent antibody technique, agglutination reaction, or Immune precipitation. 
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